<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic ETL is slow between Oracle to Vertica in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203573#M4824</link>
    <description>&lt;P&gt;Hi Everyone,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;I have an Oracle to Vertica job built in the Talend open source version. &amp;nbsp;I just have a tOracleInput --&amp;gt; tVerticaOutput. On the tOracleInput, I have the "use Cursor" checked with cursor size at 500,000. On the tVerticaOutput, I have the "use batchmode" selected and set to commit every 500,000 rows. Loading two million rows from Oracle to Vertica in 3 to 4 minutes. I think that is really long. If I dump the data from oracle into a csv file and then upload that file to vertica, it takes seconds. Does anyone have experience with this? Am I missing some option I'm not aware of? &amp;nbsp;Or is there a difference between Talend Open Source vs Talend Open Studios that would make it faster? &amp;nbsp;Thanks in advance.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 19 May 2017 16:29:53 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-05-19T16:29:53Z</dc:date>
    <item>
      <title>ETL is slow between Oracle to Vertica</title>
      <link>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203573#M4824</link>
      <description>&lt;P&gt;Hi Everyone,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;I have an Oracle to Vertica job built in the Talend open source version. &amp;nbsp;I just have a tOracleInput --&amp;gt; tVerticaOutput. On the tOracleInput, I have the "use Cursor" checked with cursor size at 500,000. On the tVerticaOutput, I have the "use batchmode" selected and set to commit every 500,000 rows. Loading two million rows from Oracle to Vertica in 3 to 4 minutes. I think that is really long. If I dump the data from oracle into a csv file and then upload that file to vertica, it takes seconds. Does anyone have experience with this? Am I missing some option I'm not aware of? &amp;nbsp;Or is there a difference between Talend Open Source vs Talend Open Studios that would make it faster? &amp;nbsp;Thanks in advance.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 19 May 2017 16:29:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203573#M4824</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-05-19T16:29:53Z</dc:date>
    </item>
    <item>
      <title>Re: ETL is slow between Oracle to Vertica</title>
      <link>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203574#M4825</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Well, to&amp;nbsp;confirm which part is slow (Oracle or Vertica), you can replace the&amp;nbsp;&lt;SPAN&gt;tVerticaOutput by a simple tFileOutputDelimited.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Now, regarding the tVerticaOutput component, you have to&amp;nbsp;change also the batchsize.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;But you may also change to use&amp;nbsp;tVerticaOutputBulkExec which is (as all t&amp;lt;DB name&amp;gt;OutputBulkExec components) dedicated to mass transfer.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Hope this helps.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 19 May 2017 17:27:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203574#M4825</guid>
      <dc:creator>TRF</dc:creator>
      <dc:date>2017-05-19T17:27:16Z</dc:date>
    </item>
    <item>
      <title>Re: ETL is slow between Oracle to Vertica</title>
      <link>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203575#M4826</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; Sorry for the delayed response. So I tried your solution and did notice an uptick in speed by a couple of seconds. So what I tested was to output oracle data to a delimited file and then use tVerticaOutputBulkexec to upload the file. This method was faster for large tables but for smaller tables, it didn't seem to matter.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;That said, i was trying to keep everything to be database to database instead of having to generate a file and then load that.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In our vertica server, we installed a sqlplus add on that can pull data from oracle to vertica much faster than what talend can do. I was just curious if i was missing anything when i setup the job. Thanks&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 24 May 2017 22:53:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203575#M4826</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-05-24T22:53:48Z</dc:date>
    </item>
    <item>
      <title>Re: ETL is slow between Oracle to Vertica</title>
      <link>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203576#M4827</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;
&lt;P&gt;I agree on this solution. It works well. Normally my environment row processing around ~100K rows/sec.&lt;/P&gt;
&lt;P&gt;Oracle --&amp;gt; tMap --&amp;gt; Vertica(outputBulkExecute)&lt;/P&gt;
&lt;P&gt;cursor=500K &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; staging with outfile, rejfile, and excfile.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Currently I'm looking for it possible to do partially&amp;nbsp;commit using outputBulkExectue of Vertica.&lt;/P&gt;
&lt;P&gt;Please advise.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Feb 2018 05:25:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/ETL-is-slow-between-Oracle-to-Vertica/m-p/2203576#M4827</guid>
      <dc:creator>DriscollBunt</dc:creator>
      <dc:date>2018-02-25T05:25:02Z</dc:date>
    </item>
  </channel>
</rss>

