Now, specifically, I'm trying to read S3 csv file to a dataframe... I Want to try to merge this data with an existent parquet file on S3.
The talend is returning the following error:
I saw once some article or person saying that on Talend Big Data is necessary download the file from S3 to HDFS firstly and after with the file inside hdfs is possible then use a Big Data Batch job to process the data. Is it correct? Would be possible do the way I'm trying or Should I try the second approach.
I found really difficult to find answers to this through the internet...