Skip to main content
Announcements
Join us at Qlik Connect for 3 magical days of learning, networking,and inspiration! REGISTER TODAY and save!
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

[resolved] parallelization In talend

Hi,
I tested above test case(Reading from excel Sorting and writing into file mentioned in 
https://help.talend.com/search/all?query=How+to+automatically+enable+parallelization+of+data+flows+f... article ) and results are as follows,
My Configuration is :
i3 processor (4 logical cores)
4GB RAM
<
   
Test Details Time taken Single Thread(sec) Time taken 3 - Thread (Sec) Time taken 2 - Thread (Sec) Rows
Reading from excel and writing to db 25 29  26 190853 Rows
Reading from excel and writing to File 3 15  5 190853 Rows
Reading from excel and writing to db 58 59 59 381706 Rows
Reading from excel Sorting and writing into file 16 81 21 381706 Rows
Reading from excel Sorting and writing into file 8 9 8 190853 Rows
 
and it seems by enabling parallelization Jobs are actually getting slower. Then what is use of  parallelization ???
 
please explain.
 
Thanks,
Pankaj
0683p000009MB7t.png
Labels (2)
11 Replies
Anonymous
Not applicable
Author

Hi  napsterpp,
Have you  already subscribed to one of the Talend Platform solutions?
Could you please report a ticket on Talend Support Portal so that we can give you a remote assistance to see if there is something wrong with your setting in current work flow. https://support.talend.com/otrs/customer.pl
Best regards
Sabrina
Anonymous
Not applicable
Author

Hi,
I am evaluating trail version provided by your sales team as part of POC.
Talend Platform for Data Management : 5.6.2
So,Could you help me out.
Thanks,
Pankaj
Anonymous
Not applicable
Author

My Job Design:

0683p000009MB7y.png 0683p000009M9YA.png
0683p000009MAwc.png
0683p000009MB3P.png
Anonymous
Not applicable
Author

Hi Pankaj,
Sabrina will look into this but also let me the Customer Success Management team on the topic (as we may need to look into your configuration more in depth).
Elisa
Anonymous
Not applicable
Author

Hi esabot,
Thanks you for your reply.
My System Configuration is
i3 processor (4 logical cores)
4GB RAM .
and I provided JOB design.
What more information do you need.
Regards,
Pankaj
Anonymous
Not applicable
Author

Hi,
Finally I was able to get desired results. all I have to do is disable departition row.
I skipped below steps. (I disabled them manually)


Departitioning ( Recollecting (

Thanks,
Pankaj
Anonymous
Not applicable
Author

Hi, I am trying to solve an performance issue around sorting huge file(50 Million record) to be sorted on Integer column+Alpha column(file has 6 columns). tSort takes around 30 mins with enabling sort on disk .
I am using TOS 5.6.2 and evaluating this sort for my POC . Please advise and the job design provided on this discussion has parallelision tab which I dont have to try it out.
Anonymous
Not applicable
Author

Hi RRaj
The parallel executions feature is not available in Talend Open Source. What's the rate of your job?
Best regards
Sabrina
Anonymous
Not applicable
Author

Thanks for quick response. can u brief on rate on what exactly you are looking for .
basically my regular tsort enabling tsort on disk in expensive taking 30 mins to process 26 Million records, so wanted to check if there are any perf tuning measures within TOS can be leveraged