<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378066#M139877</link>
    <description>&lt;P&gt;there is a lot of parameters wich can influence your job performance, it could be great to have an idea of what you do in your tMap, also the configuration of your tDBoutput, it will better if you can send pictures of your job&lt;/P&gt;&lt;P&gt;also 4gb ram is too small for Talend project you should consider to user 8GB minimum&lt;/P&gt;&lt;P&gt;try use JTDS instead of JDBC, much faster&lt;/P&gt;</description>
    <pubDate>Fri, 11 Feb 2022 15:31:58 GMT</pubDate>
    <dc:creator>gjeremy1617088143</dc:creator>
    <dc:date>2022-02-11T15:31:58Z</dc:date>
    <item>
      <title>Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378061#M139872</link>
      <description>&lt;P&gt;My requirement is to load data from positional file to staging table(truncate and insert). Positional file resides in file server and SQL server installed on DB server and Talend job is running on file server. So, here Bulk insert can't be used.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have created the job in talend and its inserting the data correctly, but the issue with this process is that it's very slow. Its processing 100-200 rows/s. I tried different ways to improve the performance but had no luck.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here are the process and system details&lt;/P&gt;&lt;P&gt;System RAM:4GB/2 processors(Intel Xenon)&lt;/P&gt;&lt;P&gt;OS: Windows sever 2012 R2/Windows 7&lt;/P&gt;&lt;P&gt;Talend Data Management Version: 7.3.1&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Approaches I tried&lt;/P&gt;&lt;P&gt;1. used tFileInputPositional(To load data from positional file)-&amp;gt;tmap(no joins in tmap)-&amp;gt;tDBOutput and JDBC connection&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;updated the parameters -Xms256M and -Xmx1536M&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;Used parallelization and created 2 threads from tFileInputPositional-&amp;gt;tmap and then departitioned before tDBOutput. Even after this, it didn't improve the performance&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Currently Talend job is running for more than one and half hours to load 3 million data to SQL server staging table. I'm able to insert same 3 million records to staging table in 10 mins using SSIS. How can i improve the talend job performance and reduce the data processing time to 10 mins like in SSIS?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please let me know if anyone has any suggestions.&lt;/P&gt;&lt;P&gt;Thanks in advance!&lt;/P&gt;</description>
      <pubDate>Fri, 15 Nov 2024 23:13:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378061#M139872</guid>
      <dc:creator>Yose</dc:creator>
      <dc:date>2024-11-15T23:13:31Z</dc:date>
    </item>
    <item>
      <title>Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378062#M139873</link>
      <description>&lt;P&gt;Hi, maybe you can send the result of the tmap in a hash, then read the hash to insert data in TDBOutput, also check that your tDBOutput is configured only on insert and not update or insert.&lt;/P&gt;&lt;P&gt;Send me love and kudos&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 08:01:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378062#M139873</guid>
      <dc:creator>gjeremy1617088143</dc:creator>
      <dc:date>2022-02-11T08:01:39Z</dc:date>
    </item>
    <item>
      <title>Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378063#M139874</link>
      <description>&lt;P&gt;Hi Jeremy, &lt;/P&gt;&lt;P&gt;I tried what you've suggested above. Now I'm getting memory issue in tHashOutput component. Error message is "the paging file too small for this operation to complete".&lt;/P&gt;&lt;P&gt;Could you please tell me how to resolve this memory issue tHashOutput component?&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 13:57:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378063#M139874</guid>
      <dc:creator>Yose</dc:creator>
      <dc:date>2022-02-11T13:57:57Z</dc:date>
    </item>
    <item>
      <title>Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378064#M139875</link>
      <description>&lt;P&gt;else if thashoutput don't solve the problem, you could try to write data in a tempory folder, it's in basic settings of the tMap component&lt;/P&gt;&lt;P&gt;https://help.talend.com/r/HqPypUO_cLctKv_dAWcgFw/GKM0cFBcfseClZruxOzlTw&lt;/P&gt;&lt;P&gt;you could also increase the commit size in the tDbOutput component&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 15:21:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378064#M139875</guid>
      <dc:creator>gjeremy1617088143</dc:creator>
      <dc:date>2022-02-11T15:21:20Z</dc:date>
    </item>
    <item>
      <title>Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378065#M139876</link>
      <description>&lt;P&gt;Even if I write data in temp folder, will this improve the job performance?&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 15:28:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378065#M139876</guid>
      <dc:creator>Yose</dc:creator>
      <dc:date>2022-02-11T15:28:28Z</dc:date>
    </item>
    <item>
      <title>Re: Talend job to Insert 3 million data from positional file to staging table in SQL Server is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378066#M139877</link>
      <description>&lt;P&gt;there is a lot of parameters wich can influence your job performance, it could be great to have an idea of what you do in your tMap, also the configuration of your tDBoutput, it will better if you can send pictures of your job&lt;/P&gt;&lt;P&gt;also 4gb ram is too small for Talend project you should consider to user 8GB minimum&lt;/P&gt;&lt;P&gt;try use JTDS instead of JDBC, much faster&lt;/P&gt;</description>
      <pubDate>Fri, 11 Feb 2022 15:31:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-job-to-Insert-3-million-data-from-positional-file-to/m-p/2378066#M139877</guid>
      <dc:creator>gjeremy1617088143</dc:creator>
      <dc:date>2022-02-11T15:31:58Z</dc:date>
    </item>
  </channel>
</rss>

