hi experts,
Can anyone please help me that how can we move data from AWS S3 to EMR Cluster using talend?
Also if I have some zip files in S3 buckets, how can I unzip it using talend before moving it to EMR cluster (I assume EMR is the hadoop cluster provided by amazon)?
I am using Talend Open studio for Bigdata and running it on my local PC.
Regards
Mukesh
Hi, If we understand your requirement very well, you can use tS3Get component to retrieve a file from Amazon S3. The work flow should be:tS3Connection-->tS3Get(retrieve files frm s3 to local)-->tfileunarchive(unzip your file)-->EMR cluster(amazon EMR). Best regards Sabrina