Skip to main content
Announcements
See what Drew Clarke has to say about the Qlik Talend Cloud launch! READ THE BLOG
cancel
Showing results for 
Search instead for 
Did you mean: 
PhilHibbs
Creator II
Creator II

Fastest way to process S3 files?

I'm currently processing a lot of S3 files by downloading them with tS3Get, reading them as sequential files, writing out sequential files, and then uploading those files back to S3 with tS3Put.

 

Is there a better way - a way of reading the data directly from S3 without going to the local filesystem?

 

I have a similar process running in a Python script and not in Talend, and I recently rewrote it to use a module called s3fs that lets me treat s3 just like a filesystem so I can open the file, read it, and write directly back to S3.

 

Is there any way of doing this in Talend?

Labels (3)
1 Reply
akumar2301
Creator III
Creator III

did you try Athena ? 

https://help.talend.com/reader/36FNvH4mND6uL3cKKzTU~w/2xTGZdvfowaLAOwQTtkNmA

 

It seems Athena can read S3 files.