<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Loading data to Amazon Redshift is very slow in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237890#M26176</link>
    <description>I am evaluating Talenb and Amazon Redshift for big data solution. Unfortunately, it looks like Talend is unable to load data in reasonable rate (around 100 Row/Sec when doing insert only). I don't want to use another 3rd part tool just for the loading, and I don't want to use files and S3 and with COPY as well.&lt;BR /&gt;Any suggestion on how to optimized Talned job to boost performance?&lt;BR /&gt;Any future plans to improve that component?&lt;BR /&gt;Thanks!&lt;BR /&gt;Ophir</description>
    <pubDate>Sat, 16 Nov 2024 11:45:27 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T11:45:27Z</dc:date>
    <item>
      <title>Loading data to Amazon Redshift is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237890#M26176</link>
      <description>I am evaluating Talenb and Amazon Redshift for big data solution. Unfortunately, it looks like Talend is unable to load data in reasonable rate (around 100 Row/Sec when doing insert only). I don't want to use another 3rd part tool just for the loading, and I don't want to use files and S3 and with COPY as well.&lt;BR /&gt;Any suggestion on how to optimized Talned job to boost performance?&lt;BR /&gt;Any future plans to improve that component?&lt;BR /&gt;Thanks!&lt;BR /&gt;Ophir</description>
      <pubDate>Sat, 16 Nov 2024 11:45:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237890#M26176</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T11:45:27Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data to Amazon Redshift is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237891#M26177</link>
      <description>Hi, 
&lt;BR /&gt;Performance issue is usually caused by the DB connection or the job design, could you please post your job screenshots into forum so that we can address this issue quickly.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 12 Feb 2014 06:36:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237891#M26177</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-02-12T06:36:46Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data to Amazon Redshift is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237892#M26178</link>
      <description>Hi Sabrina, 
&lt;BR /&gt;I found the problem.. there are 2 bottleneck in the process, first one is network, once I run the process on a server with high network connection it solve part of the problem. The second issue is the update else insert mode. Once I change it to insert only, I could run the loading in a batch mode. (Put 500 rows per insert work like magic). The data is loaded into a temp tables in Redshift and the update else insert operation is done by a join from the temp table to the target table as proposed by Amazon (Amazon best practice for upsert operation) 
&lt;BR /&gt;In that way I manage to insert 1000 Rows per second, the update else insert is very fast when done on Redshift db, so this design works for me. 
&lt;BR /&gt;I wonder if Talend could create a better upsert mechanism in the future that could use batch mode like the insert. 
&lt;BR /&gt;Thanks ! 
&lt;BR /&gt;Ophir</description>
      <pubDate>Wed, 19 Feb 2014 08:08:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237892#M26178</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-02-19T08:08:51Z</dc:date>
    </item>
    <item>
      <title>Re: Loading data to Amazon Redshift is very slow</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237893#M26179</link>
      <description>Hi, 
&lt;BR /&gt;It welcome to open a jira issue for your requirement of DI project on 
&lt;A href="https://jira.talendforge.org/secure/Dashboard.jspa" target="_blank" rel="nofollow noopener noreferrer"&gt;Talend Bug Tracker&lt;/A&gt;.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 19 Feb 2014 08:22:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-data-to-Amazon-Redshift-is-very-slow/m-p/2237893#M26179</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-02-19T08:22:13Z</dc:date>
    </item>
  </channel>
</rss>

