Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
I have a directory of files which I iterate through and output to 1 master file. I would like to sort all the rows in a particular order before outputting to a file but because I'm iterating through multiple files my output is automatically separated into batches based on the flow of rows through my job.
Is there a way to sort all the rows without having to read in the master file?
Hi,
tFileList --> tFileInputDelimited --> tHashOutput
|
+ on Subjob OK
|
tHashInput --> tSortRow --> tFileOutputDelimited (sorted with all records)
Hope this helps.
Hi,
tFileList --> tFileInputDelimited --> tHashOutput
|
+ on Subjob OK
|
tHashInput --> tSortRow --> tFileOutputDelimited (sorted with all records)
Hope this helps.
In this case would tFileList trigger SubJob Ok when all files have been read or after each file was read?
The subjob starts to work only when all files are read.
"on Subjob OK" means "when the subjob is finish with success", so in this case when all the files have been read.
Hello,
Here is a document about https://community.talend.com/t5/Design-and-Development/What-is-the-difference-between-OnSubjobOK-and...
Hope it will be helpful.
Best regards
Sabrina