<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: sort data of large size in parallel in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332950#M101778</link>
    <description>&lt;P&gt;Hi Sabrina,&lt;/P&gt;&lt;P&gt;Thanks for your reply!. &lt;/P&gt;&lt;P&gt;I did use the disk and allocated more memory but the run is still slow&lt;/P&gt;&lt;P&gt;I am trying to implement the  parallelization described in the Talend Data integration guide link below&lt;/P&gt;&lt;P&gt;https://help.talend.com/r/9bBURCEt_t~lUHE3DeE2LA/kOE3mfYu2ConQETvTlcb0Q&lt;/P&gt;&lt;P&gt;in this guide there is bullet below about activating parallelization&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;"Right-click the start component of the Job,&amp;nbsp;&lt;B&gt;tFileInputDelimited&lt;/B&gt;&amp;nbsp;in the scenario, and from the contextual menu, select&amp;nbsp;&lt;B&gt;Set parallelization&lt;/B&gt;.Then the parallelization is automatically implemented."&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I do not have this option with the Talend i downloaded &lt;/P&gt;&lt;P&gt;you mentioned about "&amp;nbsp;talend subscription solution" &lt;/P&gt;&lt;P&gt;How can i subscribe?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;-Chadi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 24 Mar 2021 13:25:27 GMT</pubDate>
    <dc:creator>CChemali1616516071</dc:creator>
    <dc:date>2021-03-24T13:25:27Z</dc:date>
    <item>
      <title>sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332948#M101776</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I am using Talend Data Integration Studio and looking to sort large data set using components&lt;/P&gt;&lt;P&gt;tFileInputPositional, tSortRow and tUniqRow&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The run is slow so i am exploring enabling parallelization but i cannot find the "Set &lt;/P&gt;&lt;P&gt;parallelization" option&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;How can i integrate this option in Talend Data Integration Studio?&lt;/P&gt;  &lt;P&gt; Thanks&lt;/P&gt;</description>
      <pubDate>Tue, 23 Mar 2021 16:25:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332948#M101776</guid>
      <dc:creator>CChemali1616516071</dc:creator>
      <dc:date>2021-03-23T16:25:12Z</dc:date>
    </item>
    <item>
      <title>Re: sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332949#M101777</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;If you set the job running in parallel it means you can have different start points and they will start in different threads in parallel.&lt;/P&gt;&lt;P&gt;With talend subscription solution, you will have the component tParallel in which you can trigger multiple sub jobs in parallel and wait for the end of all of them.&lt;/P&gt;&lt;P&gt;In your job, there are some cache component consuming two much memory. such as tUniqRow and tSortRow. For a large set of data, try to store the data on disk instead of memory on tUniqRow and tSortRow. Also, allocate more memory to execute the job.&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Wed, 24 Mar 2021 06:01:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332949#M101777</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-03-24T06:01:51Z</dc:date>
    </item>
    <item>
      <title>Re: sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332950#M101778</link>
      <description>&lt;P&gt;Hi Sabrina,&lt;/P&gt;&lt;P&gt;Thanks for your reply!. &lt;/P&gt;&lt;P&gt;I did use the disk and allocated more memory but the run is still slow&lt;/P&gt;&lt;P&gt;I am trying to implement the  parallelization described in the Talend Data integration guide link below&lt;/P&gt;&lt;P&gt;https://help.talend.com/r/9bBURCEt_t~lUHE3DeE2LA/kOE3mfYu2ConQETvTlcb0Q&lt;/P&gt;&lt;P&gt;in this guide there is bullet below about activating parallelization&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;"Right-click the start component of the Job,&amp;nbsp;&lt;B&gt;tFileInputDelimited&lt;/B&gt;&amp;nbsp;in the scenario, and from the contextual menu, select&amp;nbsp;&lt;B&gt;Set parallelization&lt;/B&gt;.Then the parallelization is automatically implemented."&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I do not have this option with the Talend i downloaded &lt;/P&gt;&lt;P&gt;you mentioned about "&amp;nbsp;talend subscription solution" &lt;/P&gt;&lt;P&gt;How can i subscribe?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;-Chadi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 24 Mar 2021 13:25:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332950#M101778</guid>
      <dc:creator>CChemali1616516071</dc:creator>
      <dc:date>2021-03-24T13:25:27Z</dc:date>
    </item>
    <item>
      <title>Re: sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332951#M101779</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;The Parallelization tab is available in talend subscription solution(paid version) not open source.&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Thu, 25 Mar 2021 06:50:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332951#M101779</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-03-25T06:50:24Z</dc:date>
    </item>
    <item>
      <title>Re: sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332952#M101780</link>
      <description>&lt;P&gt;I see. thanks !&lt;/P&gt;</description>
      <pubDate>Thu, 25 Mar 2021 13:14:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332952#M101780</guid>
      <dc:creator>CChemali1616516071</dc:creator>
      <dc:date>2021-03-25T13:14:50Z</dc:date>
    </item>
    <item>
      <title>Re: sort data of large size in parallel</title>
      <link>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332953#M101781</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;Feel free to let us know if there is any further help we can give.&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Mon, 29 Mar 2021 06:36:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/sort-data-of-large-size-in-parallel/m-p/2332953#M101781</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-03-29T06:36:29Z</dc:date>
    </item>
  </channel>
</rss>

