<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Trouble with tFileOutputDelimited in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343067#M110866</link>
    <description>tFTPDelete delete ONLY the files listed in tFTPList or delete ALL files in directory? 
&lt;BR /&gt;I ask because the source folder is extremely dynamic and while download there is another job that continuosly upload to it then I need to be sure that deletes only the files already downloaded. 
&lt;BR /&gt;Your job job wouldn't be like follow? 
&lt;BR /&gt;[size=2][font=consolas, monaco, "bitstream vera sans mono", "courier new", courier, monospace]tFTPConnection -&amp;gt; [tFTPList -&amp;gt; (iterate) -&amp;gt; tFTPGet -&amp;gt; [/font][/size] 
&lt;FONT size="2"&gt;&lt;FONT face="consolas," monaco=""&gt;(iterate) -&amp;gt;&lt;/FONT&gt;&lt;/FONT&gt;[size=2][font=consolas, monaco, "bitstream vera sans mono", "courier new", courier, monospace] tFTPDelete][/font][/size] 
&lt;BR /&gt;TKS</description>
    <pubDate>Tue, 07 Mar 2017 08:32:46 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-03-07T08:32:46Z</dc:date>
    <item>
      <title>Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343065#M110864</link>
      <description>&lt;P&gt;Hello.&lt;BR /&gt;- SCENARIO&lt;BR /&gt;[list=*]&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;I need to download files from an ftp server to local folder and then I need to delete theme&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;In the source ftp folder there is a continuosly uploading of files so I can't only do a "delete all files" job&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;I would solve in this way:&lt;BR /&gt;tFTPConnection -&amp;gt; tFTPFileList -&amp;gt; tFTPFileProperties -&amp;gt; tFTPOutputDelimited&lt;BR /&gt;&lt;BR /&gt;In this way I do:&lt;BR /&gt;1) open the connection (OK)&lt;BR /&gt;2) read all the files name in the directory (OK)&lt;BR /&gt;3) take all the details from the files (OK)&lt;BR /&gt;4) write the list of files in cvs so with the next job I can download and delete without delete any new file (WRONG)&lt;BR /&gt;&lt;BR /&gt;Everything work right: files' name are read, properties are fetched, csv is generated but inside there are only 3 blank rows without data (ftp files are 3).&lt;BR /&gt;&lt;BR /&gt;How I can solve? Probably I need to create a SCHEMA to map tFTPFileProperties to something but I don't know what and where.&lt;BR /&gt;&lt;BR /&gt;Any solution?&lt;BR /&gt;&lt;BR /&gt;Thanks in advance&lt;BR /&gt;Tato&lt;/P&gt;</description>
      <pubDate>Mon, 06 Mar 2017 17:10:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343065#M110864</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-06T17:10:27Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343066#M110865</link>
      <description>If You want go by Your way, You must convert tFTPList result into flow using 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;PRE&gt;&amp;nbsp;((String)globalMap.get("tFTPFileList_1_CURRENT_FILE"))&lt;/PRE&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;or 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;PRE&gt;&amp;nbsp;((String)globalMap.get("tFTPFileList_1_CURRENT_FILEPATH"))&lt;/PRE&gt; 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt;
  tFTPConnection -&amp;gt; tFTPList -&amp;gt; (iterate) -&amp;gt; convert variable to flow (tMap, tFixedFlowInput, tJavaFlex) -&amp;gt; tFileOutputDelimited 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;but really You can do all at same step 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;PRE&gt;tFTPConnection -&amp;gt; tFTPList -&amp;gt; (iterate) -&amp;gt; tFTPGet - tFTPDelete&lt;BR /&gt;for&amp;nbsp;&amp;nbsp;tFTPGet - tFTPDelete use&amp;nbsp;&lt;FONT face="consolas," monaco=""&gt;&amp;nbsp;((String)globalMap.get("tFTPFileList_1_CURRENT_FILEPATH")) as filename&lt;/FONT&gt;&lt;BR /&gt;&lt;/PRE&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;it is must work as expected</description>
      <pubDate>Mon, 06 Mar 2017 20:08:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343066#M110865</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2017-03-06T20:08:01Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343067#M110866</link>
      <description>tFTPDelete delete ONLY the files listed in tFTPList or delete ALL files in directory? 
&lt;BR /&gt;I ask because the source folder is extremely dynamic and while download there is another job that continuosly upload to it then I need to be sure that deletes only the files already downloaded. 
&lt;BR /&gt;Your job job wouldn't be like follow? 
&lt;BR /&gt;[size=2][font=consolas, monaco, "bitstream vera sans mono", "courier new", courier, monospace]tFTPConnection -&amp;gt; [tFTPList -&amp;gt; (iterate) -&amp;gt; tFTPGet -&amp;gt; [/font][/size] 
&lt;FONT size="2"&gt;&lt;FONT face="consolas," monaco=""&gt;(iterate) -&amp;gt;&lt;/FONT&gt;&lt;/FONT&gt;[size=2][font=consolas, monaco, "bitstream vera sans mono", "courier new", courier, monospace] tFTPDelete][/font][/size] 
&lt;BR /&gt;TKS</description>
      <pubDate>Tue, 07 Mar 2017 08:32:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343067#M110866</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-07T08:32:46Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343068#M110867</link>
      <description>tFTPDelete - delete only file which You set 
&lt;BR /&gt;so it must not be collisions</description>
      <pubDate>Tue, 07 Mar 2017 08:56:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343068#M110867</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2017-03-07T08:56:57Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343069#M110868</link>
      <description>Sorry but I can't understand: where I can set the files to be deleted?
&lt;BR /&gt;My scenario:
&lt;BR /&gt;[list=1]
&lt;BR /&gt;
&lt;LI&gt;a job upload to the folder 100 files each second (usually about 1kb each but could be bigger)&lt;/LI&gt;
&lt;BR /&gt;
&lt;LI&gt;DownoladJob run every 10s&lt;/LI&gt;
&lt;BR /&gt;
&lt;LI&gt;When job starts (after first 10s) the tFTPList fetches the name of 1.000 files&lt;/LI&gt;
&lt;BR /&gt;
&lt;LI&gt;tFTPGet (linked to tFTPList by ITERATE) downloads all 1.000 files (I'm sure because Talend Studio show "1.000 execs finished" on the iterate link)&lt;/LI&gt;
&lt;BR /&gt;
&lt;LI&gt;I linked tFTPGet to tFTPDelete with ITERATE link but on it is shown "1 exec finished"?&lt;/LI&gt;
&lt;BR /&gt;
&lt;BR /&gt;If the job take 1s to run, in the beginning there are 1.000 files but when tFTPDelete run there will be 1.100 files
&lt;BR /&gt;How I can be sure that will be deleted ONLY the 1.000 that were fetched in the beginning (and already downloaded)?
&lt;BR /&gt;
&lt;BR /&gt;Tks</description>
      <pubDate>Tue, 07 Mar 2017 09:58:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343069#M110868</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-07T09:58:50Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343070#M110869</link>
      <description>With my test tFTPDelete...I don't understand how it works! 
&lt;BR /&gt;Scenario: 
&lt;BR /&gt;[list=1] 
&lt;BR /&gt; 
&lt;LI&gt;Starting with 10 files already uploaded in source ftp folder&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;LI&gt;Run DeleteJob while uploading 125 files of different size (from 2M to 400M)&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;LI&gt;In the end of the job has been downloaded (and deleted) first 52 files (files were numbered, first 49 files were max 2M, 50th and 51st were 400M)&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;LI&gt;In the end were upload 135 files&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;Finally I think that with this component is not possible to know the exact status of the source when the job start to run. 
&lt;BR /&gt; 
&lt;BR /&gt;I will solve creating 2 jobs 
&lt;BR /&gt;[list=1] 
&lt;BR /&gt; 
&lt;LI&gt;fetch source folder writing the list of files in csv file&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;LI&gt;read the csv file to know which files need to be downloaded&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;Bye 
&lt;BR /&gt; 
&lt;BR /&gt;Bye</description>
      <pubDate>Wed, 08 Mar 2017 09:30:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343070#M110869</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-03-08T09:30:50Z</dc:date>
    </item>
    <item>
      <title>Re: Trouble with tFileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343071#M110870</link>
      <description>&lt;BLOCKQUOTE&gt;
  akatato wrote: 
 &lt;BR /&gt;Sorry but I can't understand: where I can set the files to be deleted? 
 &lt;BR /&gt;My scenario: 
 &lt;BR /&gt;[list=1] 
 &lt;BR /&gt; 
 &lt;LI&gt;a job upload to the folder 100 files each second (usually about 1kb each but could be bigger)&lt;/LI&gt; 
 &lt;BR /&gt; 
 &lt;LI&gt;DownoladJob run every 10s&lt;/LI&gt; 
 &lt;BR /&gt; 
 &lt;LI&gt;When job starts (after first 10s) the tFTPList fetches the name of 1.000 files&lt;/LI&gt; 
 &lt;BR /&gt; 
 &lt;LI&gt;tFTPGet (linked to tFTPList by ITERATE) downloads all 1.000 files (I'm sure because Talend Studio show "1.000 execs finished" on the iterate link)&lt;/LI&gt; 
 &lt;BR /&gt; 
 &lt;LI&gt;I linked tFTPGet to tFTPDelete with ITERATE link but on it is shown "1 exec finished"?&lt;/LI&gt; 
 &lt;BR /&gt; 
 &lt;BR /&gt;If the job take 1s to run, in the beginning there are 1.000 files but when tFTPDelete run there will be 1.100 files 
 &lt;BR /&gt;How I can be sure that will be deleted ONLY the 1.000 that were fetched in the beginning (and already downloaded)? 
 &lt;BR /&gt; 
 &lt;BR /&gt;Tks 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;how? why You not make a test if not trust nobody? 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;it easy - run the Job, and compare result! 
&lt;BR /&gt;tFTPDelete delete ONLY file with name which You are (I one more time tell You - YOU ARE &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; ) pass to him 
&lt;BR /&gt;so, no any problems how many files come per second, and how often run download Job 
&lt;BR /&gt; 
&lt;BR /&gt;[list=*] 
&lt;BR /&gt; 
&lt;LI&gt;tFTPList - is only one element who read file name&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;LI&gt;tFTPGet - fetch filename received from tFTPList&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;[list=*] 
&lt;BR /&gt; 
&lt;LI&gt;tFTPDelete - delete filename received from tFTPList&lt;/LI&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;and no matter - what happens - if tFTPList fetch new files - all will be downloaded and deleted, if tFTPList - not fetch file, not 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;Potential collision could be only when - file created, but other Job not finish write information 
&lt;BR /&gt;For avoid this - list files in date time order, oldest first. In this case - even if other Job will not finished write information when file name was fetched by tFTPList, it will be finished when all other files will be processed. 
&lt;BR /&gt; 
&lt;BR /&gt; 
&lt;BR /&gt;P.S. and if You seriously worry about collision (it could be of course) - change logic from Pull to Push. switch from ftp to message queue, it resolve this collisions more accurate</description>
      <pubDate>Wed, 08 Mar 2017 11:19:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Trouble-with-tFileOutputDelimited/m-p/2343071#M110870</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2017-03-08T11:19:33Z</dc:date>
    </item>
  </channel>
</rss>

