<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Bigdata -Vertica integration.[HDFS to Vertica] in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321802#M91800</link>
    <description>Hi Team,&lt;BR /&gt;I have a scenario where have to create a job to extract files from FTP(zip files .csv.tar.gz) and load to Hive tables.&lt;BR /&gt;Some logics also to add in this flow. could u please suggest a best flow to create such a talend BigData job.&lt;BR /&gt;&lt;B&gt;FTPconnection&lt;/B&gt;-&amp;gt;(on subjob ok)--&amp;gt;&lt;B&gt;tfilelist&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tUnarchive(ftp path)&lt;/B&gt;&lt;BR /&gt;&lt;B&gt;FTPlist&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tHDFSput&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tHiveLoad.&lt;/B&gt;&lt;BR /&gt;also is it necessary to give Hadoop properties everytime while establishing a HDFS connection. What is the use of it.?</description>
    <pubDate>Tue, 28 Mar 2017 05:57:14 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-03-28T05:57:14Z</dc:date>
    <item>
      <title>Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321799#M91797</link>
      <description>I have a below use case and need suggestion/optimized design ;&lt;BR /&gt;Have a source file 30-50GB size available in HDFS and i want to load this files in to Vertica. what is the best way to load the data from HDFS to Vertica ?&lt;BR /&gt;Tried using thiveinput-tmap-tverticaouput .. its very slow and throughout is 12Row/sec.&lt;BR /&gt;Tried TELTHive andTELTvertica - its not loading the data and no errors . [ but ELT will work with same DBtype and but when i see a option in talend to connect the components i tried it. but its not working&lt;BR /&gt;waiting for your suggestion/best approach.</description>
      <pubDate>Sat, 16 Nov 2024 09:58:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321799#M91797</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T09:58:32Z</dc:date>
    </item>
    <item>
      <title>Re: Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321800#M91798</link>
      <description>Hi,
&lt;BR /&gt;You can use component&amp;nbsp;
&lt;A href="https://help.talend.com/search/all?query=tSqoopExport&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;TalendHelpCenter:tSqoopExport&lt;/A&gt;&amp;nbsp;to&amp;nbsp;
&lt;FONT size="2"&gt;&lt;FONT face="noto, Helvetica, Arial, sans-serif"&gt;call sqoop to transfer data from the Hadoop Distributed File System (HDFS) to a relational database management system (RDBMS).&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="noto, Helvetica, Arial, sans-serif"&gt;For more information, please refer to the component reference.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="noto, Helvetica, Arial, sans-serif"&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="noto, Helvetica, Arial, sans-serif"&gt;Sabrina&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Thu, 23 Mar 2017 07:27:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321800#M91798</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-23T07:27:54Z</dc:date>
    </item>
    <item>
      <title>Re: Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321801#M91799</link>
      <description>Hi Sabrina,&lt;BR /&gt;Yes, I have tried that option. but I got the below error;&lt;BR /&gt;ERROR]: org.apache.sqoop.tool.BaseSqoopTool - Got error creating database manager: java.io.IOException: No manager for connect string: jdbc:vertica://&lt;S&gt;&lt;B&gt;&lt;I&gt;host&lt;/I&gt;&lt;/B&gt;&lt;/S&gt;:5433/&lt;S&gt;&lt;B&gt;&lt;I&gt;dbname&lt;/I&gt;&lt;/B&gt;&lt;/S&gt;&lt;BR /&gt;Note :i have hided host and dbname here.&lt;BR /&gt;For Vertica DB I&amp;nbsp;don't see any JDBC option in metadata , hence I used built-in option and I placed the JDBC value.</description>
      <pubDate>Tue, 28 Mar 2017 05:42:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321801#M91799</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-28T05:42:33Z</dc:date>
    </item>
    <item>
      <title>Re: Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321802#M91800</link>
      <description>Hi Team,&lt;BR /&gt;I have a scenario where have to create a job to extract files from FTP(zip files .csv.tar.gz) and load to Hive tables.&lt;BR /&gt;Some logics also to add in this flow. could u please suggest a best flow to create such a talend BigData job.&lt;BR /&gt;&lt;B&gt;FTPconnection&lt;/B&gt;-&amp;gt;(on subjob ok)--&amp;gt;&lt;B&gt;tfilelist&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tUnarchive(ftp path)&lt;/B&gt;&lt;BR /&gt;&lt;B&gt;FTPlist&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tHDFSput&lt;/B&gt;--&amp;gt;(iterate)--&amp;gt;&lt;B&gt;tHiveLoad.&lt;/B&gt;&lt;BR /&gt;also is it necessary to give Hadoop properties everytime while establishing a HDFS connection. What is the use of it.?</description>
      <pubDate>Tue, 28 Mar 2017 05:57:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321802#M91800</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-28T05:57:14Z</dc:date>
    </item>
    <item>
      <title>Re: Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321803#M91801</link>
      <description>Hi,
&lt;BR /&gt;So far, talend don't support for transferring data by air.
&lt;BR /&gt;From your job reqirement, you have to get your files from FTP(zip files .csv.tar.gz)into local firstly and then put them into hive table.&amp;nbsp;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, sans-serif"&gt;You can use the tHiveCreateTable to create a table within Hive if the table doesn't exist yet and then use&amp;nbsp;the tHiveLoad to load the local delimited file into your Hive table.&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, sans-serif"&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="1"&gt;&lt;FONT face="Verdana, sans-serif"&gt;Sabrina&lt;BR /&gt;&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Tue, 28 Mar 2017 13:49:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321803#M91801</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-28T13:49:10Z</dc:date>
    </item>
    <item>
      <title>Re: Bigdata -Vertica integration.[HDFS to Vertica]</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321804#M91802</link>
      <description>Hi&amp;nbsp;&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;Sabrina and Sam,&lt;/FONT&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT size="1"&gt;&lt;FONT face="Verdana, Helvetica, Arial, sans-serif"&gt;for this same scenario, i designed the job as below. &amp;nbsp; &amp;nbsp;the main reason is the input file is really big 300GB to 500GB. &amp;nbsp;So i don't want to download in local and do load in Hive.&lt;/FONT&gt;&lt;/FONT&gt;&lt;BR /&gt;tFTPList--&amp;gt;Tssh &amp;nbsp; TFTP list is to connect FTPserver &amp;nbsp;and use TSSH to execute the HDFS commands to place the file in HDFS&amp;nbsp;&lt;BR /&gt;once the file is available , used hive create table and hive load connector to load the same data.&lt;BR /&gt;the performance also good .&lt;BR /&gt;Hope this will help Sam.&amp;nbsp;&lt;BR /&gt;For Connection &amp;nbsp;i always use Connection component (eg hive connection or VerticaConnector) once and i reuse it for all other components. once of the best practice from Talend community.&amp;nbsp;</description>
      <pubDate>Tue, 28 Mar 2017 17:50:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Bigdata-Vertica-integration-HDFS-to-Vertica/m-p/2321804#M91802</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-28T17:50:50Z</dc:date>
    </item>
  </channel>
</rss>

