<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Talend Big Data Batch - Reading from S3 in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Talend-Big-Data-Batch-Reading-from-S3/m-p/2225084#M17531</link>
    <description>&lt;P&gt;Guys, I've started my tests with Talend Big Data.&lt;/P&gt; 
&lt;P&gt;Now, specifically, I'm trying to read S3 csv file to a dataframe... I Want to try to merge this data with an existent parquet file on S3.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="1.PNG" style="width: 666px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8Oh.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150260i37A803B08D4B358A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8Oh.png" alt="0683p000009M8Oh.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8Or.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/147935i050E1B91EF28D141/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8Or.png" alt="0683p000009M8Or.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="3.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8DG.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/148097i80F46491FE067AEF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8DG.png" alt="0683p000009M8DG.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;The talend is returning the following error:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="4.PNG" style="width: 900px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8P1.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143709i5DD554D9AE80A836/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8P1.png" alt="0683p000009M8P1.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I saw once some article or person saying that on Talend Big Data is necessary download the file from S3 to HDFS firstly and after with the file inside hdfs is possible then use a Big Data Batch job to process the data. Is it correct? Would be possible do the way I'm trying or Should I try the second approach.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I found really difficult to find answers to this through the internet...&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;André Santos&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 03:45:43 GMT</pubDate>
    <dc:creator>adolitos</dc:creator>
    <dc:date>2024-11-16T03:45:43Z</dc:date>
    <item>
      <title>Talend Big Data Batch - Reading from S3</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-Big-Data-Batch-Reading-from-S3/m-p/2225084#M17531</link>
      <description>&lt;P&gt;Guys, I've started my tests with Talend Big Data.&lt;/P&gt; 
&lt;P&gt;Now, specifically, I'm trying to read S3 csv file to a dataframe... I Want to try to merge this data with an existent parquet file on S3.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="1.PNG" style="width: 666px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8Oh.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/150260i37A803B08D4B358A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8Oh.png" alt="0683p000009M8Oh.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="2.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8Or.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/147935i050E1B91EF28D141/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8Or.png" alt="0683p000009M8Or.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="3.PNG" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8DG.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/148097i80F46491FE067AEF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8DG.png" alt="0683p000009M8DG.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;The talend is returning the following error:&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="4.PNG" style="width: 900px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M8P1.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/143709i5DD554D9AE80A836/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M8P1.png" alt="0683p000009M8P1.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I saw once some article or person saying that on Talend Big Data is necessary download the file from S3 to HDFS firstly and after with the file inside hdfs is possible then use a Big Data Batch job to process the data. Is it correct? Would be possible do the way I'm trying or Should I try the second approach.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I found really difficult to find answers to this through the internet...&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;André Santos&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 03:45:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-Big-Data-Batch-Reading-from-S3/m-p/2225084#M17531</guid>
      <dc:creator>adolitos</dc:creator>
      <dc:date>2024-11-16T03:45:43Z</dc:date>
    </item>
  </channel>
</rss>

