<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Retrieve the selected files from S3 bucket and process those in the job directly in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267227#M46186</link>
    <description>&lt;P&gt;My scenario is that every day source files will come to different dynamic date folders in S3. I need to pickup the files after last processed timestamp and get those files to use in the main flow job. I am using ts3list component to list the files with the prefix(I can't give complete path as the folder are with dynamic dates). After that in ts3get component, i have to get only files which are newer than the last processed timestamp. But not much options are available. I am able to provide only ts3list current key in the key section. With this i am getting older processed files as well. Also once i get proper file, i don't want to store in the local and process directly in the job. Please help me to achieve this scenario.&amp;nbsp; Thanks!!&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 02:22:13 GMT</pubDate>
    <dc:creator>gopal16</dc:creator>
    <dc:date>2024-11-16T02:22:13Z</dc:date>
    <item>
      <title>Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267227#M46186</link>
      <description>&lt;P&gt;My scenario is that every day source files will come to different dynamic date folders in S3. I need to pickup the files after last processed timestamp and get those files to use in the main flow job. I am using ts3list component to list the files with the prefix(I can't give complete path as the folder are with dynamic dates). After that in ts3get component, i have to get only files which are newer than the last processed timestamp. But not much options are available. I am able to provide only ts3list current key in the key section. With this i am getting older processed files as well. Also once i get proper file, i don't want to store in the local and process directly in the job. Please help me to achieve this scenario.&amp;nbsp; Thanks!!&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 02:22:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267227#M46186</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2024-11-16T02:22:13Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267228#M46187</link>
      <description>Hi 
&lt;BR /&gt;I have seen the similar requirement open by other users, unfortunately, it is impossible to read the file directly on S3, you have to download the file to local system, then process it, and delete the file from local system after it is done if needed. 
&lt;BR /&gt; 
&lt;BR /&gt;Regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Thu, 21 May 2020 10:25:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267228#M46187</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-05-21T10:25:15Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267229#M46188</link>
      <description>&lt;P&gt;Okay. What about picking up files with particular prefix from dynamic dates folders in S3 instead?&lt;/P&gt;</description>
      <pubDate>Thu, 21 May 2020 10:27:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267229#M46188</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-21T10:27:07Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267230#M46189</link>
      <description>Hi
&lt;BR /&gt;Set the key prefix to the parent folder which will contains the dynamic date folders, it will list all the sub-folders and files, then you can filter the files based on certain condition, such as file extension "*.txt".
&lt;BR /&gt;
&lt;BR /&gt;Regards
&lt;BR /&gt;Shong
&lt;BR /&gt;</description>
      <pubDate>Thu, 21 May 2020 12:51:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267230#M46189</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-05-21T12:51:37Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267231#M46190</link>
      <description>&lt;P&gt;With below flow, I am able to list the files and select particular files based upon filter component. But, it's just for listing.. I am not able to process the file in the job.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;tS3List -&amp;gt; tIterateToFlow -&amp;gt; tFilterRow -&amp;gt; tLogRow&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am not able to link to tS3Get from tFilterRow. If i link using onComponentOk from tFilterRow, then it's just picking the last file instead of all the required files. If i link tS3Get from tS3List, i am getting all the files instead of required files.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please help.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 03:19:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267231#M46190</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T03:19:27Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267232#M46191</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;, after tlogrow use the tjavarow and set the filename as context and use ts3get component from tjavarow using on component ok and you will the required files.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:17:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267232#M46191</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T05:17:05Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267233#M46192</link>
      <description>&lt;P&gt;Not able to link from tJavaRow to tS3Get with Main connection. Only onComponentOk is allowed and with that able to get only one last file. Not all the required files.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:20:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267233#M46192</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T05:20:41Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267234#M46193</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;, yes you need to use on component ok to connect. to tlogrow,how many files it was getting?&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:25:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267234#M46193</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T05:25:34Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267235#M46194</link>
      <description>&lt;P&gt;To tLogRow i am gettting 10+ records with filenames&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:26:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267235#M46194</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T05:26:46Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267236#M46195</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;,if that is the you will get all the files with the desgin of what i told. please verify.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:33:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267236#M46195</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T05:33:38Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267237#M46196</link>
      <description>&lt;P&gt;Sorry Manohar. With your approach, i am getting only one last file instead of 14 files. Below is the screenshot of that.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Sample1.JPG" style="width: 892px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MaFd.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/137582iEEA74465D4B3FDE5/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MaFd.jpg" alt="0683p000009MaFd.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:54:11 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267237#M46196</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T05:54:11Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267238#M46197</link>
      <description>&lt;P&gt;I think i found the solution. Below job did work and able to filter and get all the required files.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Sample1.JPG" style="width: 978px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MaFi.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154970iC8F67E2AEB669795/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MaFi.jpg" alt="0683p000009MaFi.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Now, i need to work on loading these files directly to a table without placing in local using s3select option in tS3Get. If anyone succeeded on s3select option, please let me know.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 05:58:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267238#M46197</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T05:58:20Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267239#M46198</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;, you need to use tflowtoIterate pass the file by file to tS3Get.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 06:07:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267239#M46198</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T06:07:17Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267240#M46199</link>
      <description>&lt;P&gt;Yes, after using tflowtointerate, it did work as mentioned previously. Now, i need help to read the files that i got from tS3Get and load all into a table. All files are having same structure. So, can be loaded into single table.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;When i tried the below, it's throwing error : Duplicate nested type row2Struct. And when i re-compiling, it's throwing other error saying one of the field cannot be resolved or is not a field. Actually i am not using that field in tFileInputDelimited.&amp;nbsp; That's defined in tIterateToFlow component only. If i am not having tFileInputDelimited and tDBOutput components in this job, it's running fine. Not sure where is the problem.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Sample1.JPG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MZrN.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156506iAEEB6ED59527BA2C/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MZrN.jpg" alt="0683p000009MZrN.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="error.JPG" style="width: 610px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MaFe.jpg"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/136572iA026D9E401343A50/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MaFe.jpg" alt="0683p000009MaFe.jpg" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 06:29:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267240#M46199</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T06:29:39Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267241#M46200</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;, til s3get no issue right,it is compilation issue you might have not configured correctly.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 06:37:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267241#M46200</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T06:37:58Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267242#M46201</link>
      <description>&lt;P&gt;till ts3get no issues. When i added tfileinputdelimited and tdboutput components, then it's throwing this error. In tfileinputdelimited, in filename place i am giving the same filename that mentioned while getting it from ts3get. And then simply connected from tfileinputdelimited to tdboutput for loading those files. I didn't change any other configuration.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 06:55:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267242#M46201</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T06:55:13Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267243#M46202</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;, you need to give the local filename along with in tfileinputdelimtted,do not use the global variabes of ts3list and check.&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 08:29:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267243#M46202</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T08:29:57Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267244#M46203</link>
      <description>&lt;P&gt;Yes, I am using the local folder only in the filename. I am giving the same filename in tinputfiledelimited that's mentioned dynamically in File option of tS3Get as i can't hardcode the filename.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 08:38:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267244#M46203</guid>
      <dc:creator>gopal16</dc:creator>
      <dc:date>2020-05-26T08:38:18Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267245#M46204</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LRAaAAO"&gt;@gopal16&lt;/A&gt;&amp;nbsp;, can you share screenshot of tfileinputdelimitted&lt;/P&gt;</description>
      <pubDate>Tue, 26 May 2020 09:19:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267245#M46204</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2020-05-26T09:19:44Z</dc:date>
    </item>
    <item>
      <title>Re: Retrieve the selected files from S3 bucket and process those in the job directly</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267246#M46205</link>
      <description>&lt;P&gt;@Manohar B​&amp;nbsp;: is there a way to get just the latest files from S3 if my client is placing all the files in same S3 bucket. File name changes every hour.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Sep 2021 13:37:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Retrieve-the-selected-files-from-S3-bucket-and-process-those-in/m-p/2267246#M46205</guid>
      <dc:creator>sushantk19</dc:creator>
      <dc:date>2021-09-29T13:37:59Z</dc:date>
    </item>
  </channel>
</rss>

