I need some help devising a strategy to parse JSON docs within a talend job (Java job, not Perl). I am using Talend Version: 5.0.2 and developing on a Mac, planning to run on a Linux box.
Unfortunately, I cannot use the tFileInputJSON component because of the format of my files -- each file contains several hundred JSON docs, with a complete JSON doc taking up one line in the file. I think the right solution is to read the file line by line then pass it into a JSON parser and from there send the results to the rest of the job.
As I see it my options are:
a) send the line input to some sort of Java JSON parser. If that's the strategy I need to take, I'd like some advice on how to deal with the output and passing that output into my tmap/other parts of the job.
b) find a Talend component that parses JSON docs, but doesn't require an input of a file with a single valid JSON format.
I've searched around for this component but can't seem to find it. From my search, it seems even the tFileInputJSON component is relatively new.
Anyone have some advice on where I should turn next?
Thanks in advance.