Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik GA: Multivariate Time Series in Qlik Predict: Get Details
cancel
Showing results for 
Search instead for 
Did you mean: 
TomG1
Creator
Creator

Internal working of parallelization componets

I have used parallelization components in a talend job. I am confused how this internally works.

0683p000009LtlP.png

 

Here three threads are created and passed to the tSortRow component. I am partitioning on the basis of one column(lets say A) and sorting on the basis on another column(lets say B). How is the sorting done by tsortrow component. is tSortRow component combining all rows received from all threads and then doing sorting. Once sorting is done , how is the sorted data again passed as three threads to departitioner?

Labels (2)
4 Replies
TRF
Champion II
Champion II

TomG1
Creator
Creator
Author

@TRF

 

The talend help site has given only an overview of the different parallelization components.

I want to go a little detailed(details i asked in the first place) so that I can use these components in my job for performance increase.

 

Thanks

Anonymous
Not applicable

I believe that this will shed some light on the matter ..... https://help.talend.com/display/KB/How+to+automatically+enable+parallelization+of+data+flows+for+bet...

As a rule of thumb if you are concerned about efficiency with this then you want to look at the number of threads you will be using. This depends on the number of cores your machine has. If you are on a 4 core machine, use a max of 3 threads (I believe this is explained in the link).

Anonymous
Not applicable

OK, just realised I answered a different question. I guess this *may* be of some use, hence I have left it.

 

However, I have to ask why you are trying to make this more efficient by getting Java to sort your data? Do the sorting in the DB. Or is this just an experiment?