<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to optimize tSortRow with 10M rows ? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236838#M25451</link>
    <description>In your tSQLinput query add : order by &amp;lt;column&amp;gt;
&lt;BR /&gt;
&lt;BR /&gt;Alternatively, it sounds like a load once to this table? say from a multiple file source, first store them in smaller fragments, write output based on some logic... like a file for each week of the year / data you want to sort by. Then process these smaller files and sort them before writing to db.
&lt;BR /&gt;
&lt;BR /&gt;Alternatively, write to a tmp table, and next write a tsql : insert into finaltable as select... from tmptable order by your columns.
&lt;BR /&gt;
&lt;BR /&gt;</description>
    <pubDate>Fri, 24 Aug 2018 10:49:40 GMT</pubDate>
    <dc:creator>Jesperrekuh</dc:creator>
    <dc:date>2018-08-24T10:49:40Z</dc:date>
    <item>
      <title>How to optimize tSortRow with 10M rows ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236837#M25450</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class=""&gt;Do you have tips to optimize a 10 million lines processing with a tSortRow before inserting it into the database?&lt;BR /&gt;&lt;BR /&gt;I have good performance at the beginning (~ 6600rows / s), the more the number of treated lines increases, the more the performances decrease. Arrived at 600 000 lines, I have the error &lt;FONT face="courier new,courier"&gt;OutOfMemoryError: GC overhead limit exceeded&lt;/FONT&gt; (I could increase the memory of the JVM for the job, but I think it's not optimal)&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class=""&gt;Thanks.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Aug 2018 08:52:11 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236837#M25450</guid>
      <dc:creator>Mayot</dc:creator>
      <dc:date>2018-08-24T08:52:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to optimize tSortRow with 10M rows ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236838#M25451</link>
      <description>In your tSQLinput query add : order by &amp;lt;column&amp;gt;
&lt;BR /&gt;
&lt;BR /&gt;Alternatively, it sounds like a load once to this table? say from a multiple file source, first store them in smaller fragments, write output based on some logic... like a file for each week of the year / data you want to sort by. Then process these smaller files and sort them before writing to db.
&lt;BR /&gt;
&lt;BR /&gt;Alternatively, write to a tmp table, and next write a tsql : insert into finaltable as select... from tmptable order by your columns.
&lt;BR /&gt;
&lt;BR /&gt;</description>
      <pubDate>Fri, 24 Aug 2018 10:49:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236838#M25451</guid>
      <dc:creator>Jesperrekuh</dc:creator>
      <dc:date>2018-08-24T10:49:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to optimize tSortRow with 10M rows ?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236839#M25452</link>
      <description>Thanks, i will try this</description>
      <pubDate>Fri, 24 Aug 2018 13:34:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-optimize-tSortRow-with-10M-rows/m-p/2236839#M25452</guid>
      <dc:creator>Mayot</dc:creator>
      <dc:date>2018-08-24T13:34:25Z</dc:date>
    </item>
  </channel>
</rss>

