<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tloop performance in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319479#M89719</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In my opinion, reading 5 million and load into DB will not cause over memory problem.&lt;/P&gt;&lt;P&gt;You may simply use Component Bulk load to load data into your database which is way faster than using DB output.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Andy&lt;/P&gt;</description>
    <pubDate>Tue, 19 Jan 2021 10:48:52 GMT</pubDate>
    <dc:creator>CLi1594691515</dc:creator>
    <dc:date>2021-01-19T10:48:52Z</dc:date>
    <item>
      <title>tloop performance</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319478#M89718</link>
      <description>&lt;P&gt;Hi Talend contributors,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My work requires a job that read about 5 million records from a csv. To avoid over memory problem, I design my job like this:&lt;/P&gt;&lt;P&gt;tFileRowCount =&amp;gt; tLoop (loop 1000 records per time) =&amp;gt; tFileInputDelimited =&amp;gt; tDbOutput&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So this is a good idea to avoid overload of memory, that's read the csv 1000 records for each time of loop running.&lt;/P&gt;&lt;P&gt;But I am not sure how the tloop manages memory.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So my question is, does tloop automatically clear memory each time it's start a new round of loop.&lt;/P&gt;&lt;P&gt;Example as java code:&lt;/P&gt;&lt;P&gt;for(int i = 1; i &amp;lt;= 5 millions; i=i+1000) {&lt;/P&gt;&lt;P&gt;   // read csv from now number i to i+1000 and write to db&lt;/P&gt;&lt;P&gt;}&lt;/P&gt;&lt;P&gt;=&amp;gt; in this case,  the job's memory only store 1000 records all the time, right?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 00:48:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319478#M89718</guid>
      <dc:creator>Blade</dc:creator>
      <dc:date>2024-11-16T00:48:01Z</dc:date>
    </item>
    <item>
      <title>Re: tloop performance</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319479#M89719</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In my opinion, reading 5 million and load into DB will not cause over memory problem.&lt;/P&gt;&lt;P&gt;You may simply use Component Bulk load to load data into your database which is way faster than using DB output.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Andy&lt;/P&gt;</description>
      <pubDate>Tue, 19 Jan 2021 10:48:52 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319479#M89719</guid>
      <dc:creator>CLi1594691515</dc:creator>
      <dc:date>2021-01-19T10:48:52Z</dc:date>
    </item>
    <item>
      <title>Re: tloop performance</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319480#M89720</link>
      <description>&lt;P&gt;Hi @Chun Yin Li​&amp;nbsp;, thank you for the suggestion. 5 million is just an example &lt;span class="lia-unicode-emoji" title=":grinning_face_with_sweat:"&gt;😅&lt;/span&gt; Let's say we dont know how many input we have, maybe 5 billion or 100 billion, so that should be a better way to handle this very big data&lt;/P&gt;</description>
      <pubDate>Tue, 19 Jan 2021 11:01:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319480#M89720</guid>
      <dc:creator>Blade</dc:creator>
      <dc:date>2021-01-19T11:01:33Z</dc:date>
    </item>
    <item>
      <title>Re: tloop performance</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319481#M89721</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Applying your concern. &lt;/P&gt;&lt;P&gt;I think you may apply following&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;use tFileRowCount to see if row count larger than certain row to proceed point 2 else load them into DB directly&lt;/LI&gt;&lt;LI&gt;tFileInput -&amp;gt; tFileOutput (use Split output in several file option) to split into small rows files and load them into DB one by one &lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;if storage is not the concerning point.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2021 01:49:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tloop-performance/m-p/2319481#M89721</guid>
      <dc:creator>CLi1594691515</dc:creator>
      <dc:date>2021-01-20T01:49:44Z</dc:date>
    </item>
  </channel>
</rss>

