<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to use parallelization of data flows. in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360445#M125022</link>
    <description>Hi, 
&lt;BR /&gt;Based on your job description, i don't think that you have joins in tMap. What you are doing is filtering application... you can use tfilterrow component. One more thing is that you are not receiving memory error, you can also think ff Hash components to replicate data to two filter row components and write data independently to your output device. 
&lt;BR /&gt;Vaibhav</description>
    <pubDate>Mon, 15 Sep 2014 15:31:47 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2014-09-15T15:31:47Z</dc:date>
    <item>
      <title>How to use parallelization of data flows.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360444#M125021</link>
      <description>Hi, 
&lt;BR /&gt;I tried to test the new feature called "parallelization of data flows".&amp;nbsp; 
&lt;BR /&gt;Consequently, I created the Talend job below. 
&lt;BR /&gt;tFileInputDelimited with 5000000 rows, 
&lt;BR /&gt;| row1 
&lt;BR /&gt;tMap with a little transformation, 
&lt;BR /&gt;| row2 
&lt;BR /&gt;tFileOutputDelimited 
&lt;BR /&gt;For row1 : 
&lt;BR /&gt;« Nombre de process enfants » = 3 (I have 4 processors) 
&lt;BR /&gt;« QUEUE_SIZE » = 5000. 
&lt;BR /&gt;Pour row2 : 
&lt;BR /&gt;« Merge sort partitions » = yes 
&lt;BR /&gt;« QUEUE_SIZE de out1 » = 5000 
&lt;BR /&gt;Unfortunatly, my results are really bad : the new fonctionality increase the processing time by 10 seconds. 
&lt;BR /&gt;On the internet, I only saw examples of this feature with a tSortRow component.&amp;nbsp; 
&lt;BR /&gt;Could you tell me if it is possible to implement this new feature with other Talend components ? 
&lt;BR /&gt;Regards.</description>
      <pubDate>Mon, 15 Sep 2014 11:06:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360444#M125021</guid>
      <dc:creator>DerfelCadarn</dc:creator>
      <dc:date>2014-09-15T11:06:56Z</dc:date>
    </item>
    <item>
      <title>Re: How to use parallelization of data flows.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360445#M125022</link>
      <description>Hi, 
&lt;BR /&gt;Based on your job description, i don't think that you have joins in tMap. What you are doing is filtering application... you can use tfilterrow component. One more thing is that you are not receiving memory error, you can also think ff Hash components to replicate data to two filter row components and write data independently to your output device. 
&lt;BR /&gt;Vaibhav</description>
      <pubDate>Mon, 15 Sep 2014 15:31:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360445#M125022</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-09-15T15:31:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to use parallelization of data flows.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360446#M125023</link>
      <description>hi all,&lt;BR /&gt;I guess that the purpose is to know if parallelization of data flows as explained in talend help is only to optimize tSortrow component or any other ones :&lt;BR /&gt;&lt;A href="https://help.talend.com/search/all?query=How+to+enable+parallelization+of+data+flows&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/search/all?query=How+to+enable+parallelization+of+data+flows&amp;amp;content-lang=en&lt;/A&gt;&lt;BR /&gt;Seems that functionnality is an automation of tPartionner :&lt;BR /&gt;&lt;A href="https://help.talend.com/search/all?query=tPartitioner&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/search/all?query=tPartitioner&amp;amp;content-lang=en&lt;/A&gt;&lt;BR /&gt;but I don't know if it can use on other component as asked ...&lt;BR /&gt;regards&lt;BR /&gt;laurent</description>
      <pubDate>Mon, 15 Sep 2014 15:50:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360446#M125023</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-09-15T15:50:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to use parallelization of data flows.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360447#M125024</link>
      <description>Yes, that is exactly what I wanted to know :&amp;nbsp;
&lt;BR /&gt;- Is it possible to use this new feature with an other component than a tSortRow ?
&lt;BR /&gt;- Could you please give us some "use cases" ?
&lt;BR /&gt;Thank you in advance,
&lt;BR /&gt;Regards.</description>
      <pubDate>Mon, 15 Sep 2014 16:17:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-use-parallelization-of-data-flows/m-p/2360447#M125024</guid>
      <dc:creator>DerfelCadarn</dc:creator>
      <dc:date>2014-09-15T16:17:27Z</dc:date>
    </item>
  </channel>
</rss>

