<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tSortRow runs very slow with EXCEL Input in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279455#M54615</link>
    <description>Hi Sean,
&lt;BR /&gt;Have you tried with tExternalSortRow ? This component writes the incoming data flow to a temporary file before sorting it with gnu sort. It avoids data bufferization and thus reduces memory consumption.
&lt;BR /&gt;Hope it helps.
&lt;BR /&gt;Richard</description>
    <pubDate>Sat, 22 Mar 2008 09:00:43 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2008-03-22T09:00:43Z</dc:date>
    <item>
      <title>tSortRow runs very slow with EXCEL Input</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279453#M54613</link>
      <description>Hi! 
&lt;BR /&gt;I am trying to sort about 90K rows coming from a multisheet excel file. If I use Excel Input and Sort (I am sorting on 4 keys out of about 10 columns), tSortRow runs very slow (2 rows/sec). If I remove the excel file and put a CSV file created from the same excel, the performance improves many folds. 
&lt;BR /&gt;A workaround to this is, of course, to convert this XL file to CSV and then use that as an input. I am just trying to figure out why the response for tSortRow be different for different inputs. 
&lt;BR /&gt;Note that I am using the new XL enhancement that allows me to read multiple worksheets without using tUnite. 
&lt;BR /&gt;Thanks. 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Sean</description>
      <pubDate>Sat, 16 Nov 2024 14:25:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279453#M54613</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T14:25:15Z</dc:date>
    </item>
    <item>
      <title>Re: tSortRow runs very slow with EXCEL Input</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279454#M54614</link>
      <description>Additional information for the above query: &lt;BR /&gt;I tried running a job that extracted XL to the CSV and in a sub job (on job OK), I read from the same CSV that was just created. The result is the same low sorter speed. When I just use the CSV file in a standalone job, I get very fast speeds. &lt;BR /&gt;My workaround right now is to have one job to create the CSV and another one to read the CSV and do the processing. I am trying to keep everything to one PERL script but this will result into at least 2 scripts. &lt;BR /&gt;Thanks&lt;BR /&gt;Sean</description>
      <pubDate>Sat, 22 Mar 2008 04:58:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279454#M54614</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2008-03-22T04:58:27Z</dc:date>
    </item>
    <item>
      <title>Re: tSortRow runs very slow with EXCEL Input</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279455#M54615</link>
      <description>Hi Sean,
&lt;BR /&gt;Have you tried with tExternalSortRow ? This component writes the incoming data flow to a temporary file before sorting it with gnu sort. It avoids data bufferization and thus reduces memory consumption.
&lt;BR /&gt;Hope it helps.
&lt;BR /&gt;Richard</description>
      <pubDate>Sat, 22 Mar 2008 09:00:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279455#M54615</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2008-03-22T09:00:43Z</dc:date>
    </item>
    <item>
      <title>Re: tSortRow runs very slow with EXCEL Input</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279456#M54616</link>
      <description>I think there is no other problem that the way the "real time statistics" performance rate (rows/s above each row link) is calculated. Don't take it into account, just read the total execution time and you'll see the input doesn't affect tSortRow performances.
&lt;BR /&gt;Talend Open Studio generated code model was designed so that components are independant.</description>
      <pubDate>Sat, 22 Mar 2008 14:37:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279456#M54616</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2008-03-22T14:37:26Z</dc:date>
    </item>
    <item>
      <title>Re: tSortRow runs very slow with EXCEL Input</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279457#M54617</link>
      <description>Well. You are right. I am doing something wrong to slow this job down. When I created a simple job with EXCEL input, a sort and then the CSV output, it was fast. So I need to look more as to why that particular job is slowing down. I'll get back to you on that. 
&lt;BR /&gt;Regards,
&lt;BR /&gt;Sean</description>
      <pubDate>Sat, 22 Mar 2008 15:22:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tSortRow-runs-very-slow-with-EXCEL-Input/m-p/2279457#M54617</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2008-03-22T15:22:25Z</dc:date>
    </item>
  </channel>
</rss>

