<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Inserting data to redshift using tredshiftoutputbulkexec --&amp;gt;slow loading in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240789#M28140</link>
    <description>&lt;P&gt;Is it necessary to drop the table and recreate each time? Admittedly that shouldn't make the query run at 12 minutes.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The only other thing i have in my redshift setup is that in advanced i tick the &lt;STRONG&gt;Compressed by - gzip&lt;/STRONG&gt; checkbox. You could log into AWS cloudwatch and view the queries that are being executed as the job runs, this might give you an idea if the problem is on the redshift side or not.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 07 Jan 2020 09:39:00 GMT</pubDate>
    <dc:creator>MattE</dc:creator>
    <dc:date>2020-01-07T09:39:00Z</dc:date>
    <item>
      <title>Inserting data to redshift using tredshiftoutputbulkexec --&gt;slow loading</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240787#M28138</link>
      <description>&lt;P&gt;Initially I tried&amp;nbsp;tmongodbinput ==&amp;gt; tredshiftoutput&amp;nbsp; the data loaded succesfully, but it took me about 460 rows/second it takes too long time(12 mins). To improve the performance tried&amp;nbsp;tredshiftoutputbulkexec component.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Now trying to load my mongodb data into the redshift (3 lakhs data). Going from (tmongodbinput) directly to (tredshiftoutputbulkexec)&amp;nbsp; got me about same time approx 450 rows/second(12-13 mins). tredshiftoutputbulkexec also takes same time to load data. how to improve the performance?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;Reading the data is faster in tredshiftoutputbulkexec but commits takes same as tredshiftoutput.&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;I attached the screen shot below.&lt;/P&gt; 
&lt;P&gt;-----------------------------------------------&lt;/P&gt; 
&lt;P&gt;Can anyone please help me with the solution.&lt;/P&gt; 
&lt;P&gt;Please do let me know if you need any more details.&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="publish.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M9HN.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/145026i11F7CDF74D0CF95B/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M9HN.png" alt="0683p000009M9HN.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 03:39:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240787#M28138</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T03:39:50Z</dc:date>
    </item>
    <item>
      <title>Re: Inserting data to redshift using tredshiftoutputbulkexec --&gt;slow loading</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240788#M28139</link>
      <description>mongo and redshift data… are they on hosted machines… how is the network between them. if there is no speed variations.. the trouble may be on server speed….
&lt;BR /&gt;</description>
      <pubDate>Tue, 07 Jan 2020 09:26:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240788#M28139</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2020-01-07T09:26:10Z</dc:date>
    </item>
    <item>
      <title>Re: Inserting data to redshift using tredshiftoutputbulkexec --&gt;slow loading</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240789#M28140</link>
      <description>&lt;P&gt;Is it necessary to drop the table and recreate each time? Admittedly that shouldn't make the query run at 12 minutes.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The only other thing i have in my redshift setup is that in advanced i tick the &lt;STRONG&gt;Compressed by - gzip&lt;/STRONG&gt; checkbox. You could log into AWS cloudwatch and view the queries that are being executed as the job runs, this might give you an idea if the problem is on the redshift side or not.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jan 2020 09:39:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Inserting-data-to-redshift-using-tredshiftoutputbulkexec-gt-slow/m-p/2240789#M28140</guid>
      <dc:creator>MattE</dc:creator>
      <dc:date>2020-01-07T09:39:00Z</dc:date>
    </item>
  </channel>
</rss>

