Hi,
I am using Talend Enterprise Data Integration Version: 5.5.1 Build id: r118616 and wanted to process a delimited file in parallel. However, the enable parallel execution for the component tFileInputDelimited is greyed out. Is there anything that I need to do to so that I can enable this option?
Regards.
Allan
Hi,
So far, enable parallel execution feature is not available in 5.5.1.
Could you please take a look at component
TalendHelpCenter:tParallelize which allows you to synchronize the execution of a subjob with the execution of other subjobs in your main Job.(It is available in Talend Enterprise Subscription Version).
Best regards
Sabrina
Hi Sabrina, Is that enabled in other versions? If so which versions is it enabled. As much as I would wanted to use tParallelize, I have to admit that I am quite new to Talend and would rather not manually do threading if I have the choice. Thanks and warm regards. Allan
Hi jholman,
Apologies, it seems I am not allowed to post images or URLs. I know you will be able to make sense of the below.
//www.talendforge.org/forum/img/members/243775/mini_enable_parallel_greyed_out.png
Regards.
Allan
Apologies, here is the full sized image. The checkbox is in the bottom and highlighted in yellow.
//www.talendforge.org/forum/img/members/243775/enable_parallel_greyed_out.png
Regards.
Allan
Hi,
So far, we don't support the function "Enable parallel execution" in advanced setting of tfileinputdelimited in Talend.
Could you please give us more description about your job requirment? Is there any problem when you use tParallelize? Do you want to use multi thread execution?
Best regards
Sabrina
Hi Sabrina, We require processing of very large files (trades, orders, etc.) and I want to have multi threaded specific data flows in the job (i.e. enrichment, etc.). So unless I am missing something, tParallelize is not really a solution for me. What I am looking for is something like a: tFileInputDelimited -> tPartitioner -> tCollector -> (some random transformation component like tMap) -> tDepartitioner -> tRecollector -> (some additional transformation) -> (load to database) I have read in one of Talend's articles (//help.talend.com/display/KB/How+to+automatically+enable+parallelization+of+data+flows+for+better+performance) that this is supposed to be possible. Regards. Allan
The last time I looked the tFileInput components use a blocking i/o library in Java so having multiple threads read the file is not really possible. You can always just split your files and the have multiple readers read each chunk in parallel. I'm not exactly how to implement what your asking for but I will try to summon RBaldwin to this thread, if any one can answer he can since he wrote all the MPP components.