<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Synchronize files from unix to hdfs in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333805#M102550</link>
    <description>Hi&amp;nbsp; 
&lt;BR /&gt;tUnite component is needed in this job to merge all the file name before doing the join.&amp;nbsp; 
&lt;BR /&gt; 
&lt;PRE&gt;After this am I doing an FTPGet getting the file locally and then doing an HDFSPut? Is there anyway to orchestrate sending the file directly from the remote Unix server to HDFS?&lt;/PRE&gt; 
&lt;BR /&gt;No a direct way to move the file between remote server and HDFS, you have to get it to local system and then put it to HDFS. 
&lt;BR /&gt;Regards 
&lt;BR /&gt;Shong</description>
    <pubDate>Fri, 15 Apr 2016 05:06:05 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-04-15T05:06:05Z</dc:date>
    <item>
      <title>Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333800#M102545</link>
      <description>I want to try and setup a job that would get a list of files from a remote server (Unix) and compare it with files listed in HDFS. If the file does not exist I want to get the file from the Unix server and put it into HDFS. Can anyone point me in the right starting direction on how I might be able to do this? I am using the latest 6.1 version of the Talend Big Data Studio.</description>
      <pubDate>Sat, 16 Nov 2024 10:42:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333800#M102545</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2024-11-16T10:42:01Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333801#M102546</link>
      <description>Hi&amp;nbsp; 
&lt;BR /&gt;What protocols do you want to access the remote server (Unix) and get the file? FTP? SCP or http? And do you just want to compare the file name or the file content? 
&lt;BR /&gt;Regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Wed, 13 Apr 2016 03:09:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333801#M102546</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-13T03:09:55Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333802#M102547</link>
      <description>I have access to scp and ftp. I just want to compare the file names since they are unique per day.</description>
      <pubDate>Wed, 13 Apr 2016 13:03:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333802#M102547</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-04-13T13:03:19Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333803#M102548</link>
      <description>Hi 
&lt;BR /&gt;You can use txxxList to get all the file names from remote server and HDFS server, do an inner join between remote files and HDFS files and get the unmatched records, eg: 
&lt;BR /&gt;tFTPFileList--iterate--tFixedFlowInput1--main--tUnite--main--tMap--out1--&amp;gt; 
&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; | 
&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;lookup 
&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; | 
&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; tHDFS--iterate--tFixedFlowinput2--main--tUnite- 
&lt;BR /&gt;tFixedFlowInput1: define one column and set its value as: 
&lt;BR /&gt;((String)globalMap.get("tFTPFileList_1_CURRENT_FILE")) 
&lt;BR /&gt;tFixedFlowInput2: define one column and set its value as: 
&lt;BR /&gt;((String)globalMap.get("tHDFSList_1_CURRENT_FILE")) 
&lt;BR /&gt;Refer to this KB article: 
&lt;BR /&gt; 
&lt;A href="https://help.talend.com/pages/viewpage.action?pageId=190513450" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/pages/viewpage.action?pageId=190513450&lt;/A&gt; 
&lt;BR /&gt;Regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Thu, 14 Apr 2016 04:09:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333803#M102548</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-14T04:09:12Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333804#M102549</link>
      <description>Thank you for that! 
&lt;BR /&gt;I created the flow as you suggested. In this I am not sure the need for the unite element but I have them there anyways. 
&lt;BR /&gt; 
&lt;BR /&gt;After this am I doing an FTPGet getting the file locally and then doing an HDFSPut? Is there anyway to orchestrate sending the file directly from the remote Unix server to HDFS?</description>
      <pubDate>Thu, 14 Apr 2016 17:48:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333804#M102549</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-14T17:48:37Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333805#M102550</link>
      <description>Hi&amp;nbsp; 
&lt;BR /&gt;tUnite component is needed in this job to merge all the file name before doing the join.&amp;nbsp; 
&lt;BR /&gt; 
&lt;PRE&gt;After this am I doing an FTPGet getting the file locally and then doing an HDFSPut? Is there anyway to orchestrate sending the file directly from the remote Unix server to HDFS?&lt;/PRE&gt; 
&lt;BR /&gt;No a direct way to move the file between remote server and HDFS, you have to get it to local system and then put it to HDFS. 
&lt;BR /&gt;Regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Fri, 15 Apr 2016 05:06:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333805#M102550</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-15T05:06:05Z</dc:date>
    </item>
    <item>
      <title>Re: Synchronize files from unix to hdfs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333806#M102551</link>
      <description>Thanks for all of your help!</description>
      <pubDate>Fri, 15 Apr 2016 21:00:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Synchronize-files-from-unix-to-hdfs/m-p/2333806#M102551</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-15T21:00:24Z</dc:date>
    </item>
  </channel>
</rss>

