<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Faster .csv read processing in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Faster-csv-read-processing/m-p/2375552#M137953</link>
    <description>&lt;P&gt;I have a job that reads data from a csv file and then process it compering it with the data in my database. At this moment my job is processing 700 rows of the csv file in 2min and for me is slow cause i have files with +20.000 rows. My job is something like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;tFileOutputDelimited &amp;gt; tReplace &amp;gt; 2 DB components to get additional data to the flow &amp;gt; tFilterRow &amp;gt; tMap &amp;gt; DB INSERT&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Do somebody know how to improve the performance of the job? When i read the file only using the tFileOutputDelimited it reads the whole file in 3 secs. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 01:38:07 GMT</pubDate>
    <dc:creator>Tech8</dc:creator>
    <dc:date>2024-11-16T01:38:07Z</dc:date>
    <item>
      <title>Faster .csv read processing</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Faster-csv-read-processing/m-p/2375552#M137953</link>
      <description>&lt;P&gt;I have a job that reads data from a csv file and then process it compering it with the data in my database. At this moment my job is processing 700 rows of the csv file in 2min and for me is slow cause i have files with +20.000 rows. My job is something like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;tFileOutputDelimited &amp;gt; tReplace &amp;gt; 2 DB components to get additional data to the flow &amp;gt; tFilterRow &amp;gt; tMap &amp;gt; DB INSERT&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Do somebody know how to improve the performance of the job? When i read the file only using the tFileOutputDelimited it reads the whole file in 3 secs. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 01:38:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Faster-csv-read-processing/m-p/2375552#M137953</guid>
      <dc:creator>Tech8</dc:creator>
      <dc:date>2024-11-16T01:38:07Z</dc:date>
    </item>
    <item>
      <title>Re: Faster .csv read processing</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Faster-csv-read-processing/m-p/2375553#M137954</link>
      <description>&lt;P&gt;@Tech Eight​&amp;nbsp;, may be you can try with tDBBulk component to insert into DB.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;check the below links.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://help.talend.com/reader/aMa3LeRerDnYLmJvEPq0bw/oFG_MwrzFFCtU1MJVK8Pdw" alt="https://help.talend.com/reader/aMa3LeRerDnYLmJvEPq0bw/oFG_MwrzFFCtU1MJVK8Pdw" target="_blank"&gt;https://help.talend.com/reader/aMa3LeRerDnYLmJvEPq0bw/oFG_MwrzFFCtU1MJVK8Pdw&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://help.talend.com/reader/tXRG~nTonRYUwbOJscDgxw/KS~ToADRI4boTFy9BN2GPA" alt="https://help.talend.com/reader/tXRG~nTonRYUwbOJscDgxw/KS~ToADRI4boTFy9BN2GPA" target="_blank"&gt;https://help.talend.com/reader/tXRG~nTonRYUwbOJscDgxw/KS~ToADRI4boTFy9BN2GPA&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Manohar&lt;/P&gt;</description>
      <pubDate>Thu, 20 Aug 2020 13:22:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Faster-csv-read-processing/m-p/2375553#M137954</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-08-20T13:22:07Z</dc:date>
    </item>
  </channel>
</rss>

