Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Bucharest on Sept 18th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Parse file containing multiple JSON documents

I need some help devising a strategy to parse JSON docs within a talend job (Java job, not Perl). I am using Talend Version: 5.0.2 and developing on a Mac, planning to run on a Linux box.
Unfortunately, I cannot use the tFileInputJSON component because of the format of my files -- each file contains several hundred JSON docs, with a complete JSON doc taking up one line in the file. I think the right solution is to read the file line by line then pass it into a JSON parser and from there send the results to the rest of the job.
As I see it my options are:
a) send the line input to some sort of Java JSON parser. If that's the strategy I need to take, I'd like some advice on how to deal with the output and passing that output into my tmap/other parts of the job.
b) find a Talend component that parses JSON docs, but doesn't require an input of a file with a single valid JSON format.
I've searched around for this component but can't seem to find it. From my search, it seems even the tFileInputJSON component is relatively new.
Anyone have some advice on where I should turn next?
Thanks in advance.

This post closely mirrors a previous, unanswered post: https://community.talend.com/t5/Design-and-Development/Iterate-Through-A-Multi-Row-JSON-File/td-p/10...
Labels (5)
10 Replies
Anonymous
Not applicable
Author

@Moe @xdshi@gbadge Trying to solve similar problem, not able to see any images shared by Pedro, has anybody solved this problem yet?