Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Save $650 on Qlik Connect, Dec 1 - 7, our lowest price of the year. Register with code CYBERWEEK: Register
cancel
Showing results for 
Search instead for 
Did you mean: 
PhilHibbs
Creator II
Creator II

Fastest way to process S3 files?

I'm currently processing a lot of S3 files by downloading them with tS3Get, reading them as sequential files, writing out sequential files, and then uploading those files back to S3 with tS3Put.

 

Is there a better way - a way of reading the data directly from S3 without going to the local filesystem?

 

I have a similar process running in a Python script and not in Talend, and I recently rewrote it to use a module called s3fs that lets me treat s3 just like a filesystem so I can open the file, read it, and write directly back to S3.

 

Is there any way of doing this in Talend?

Labels (3)
1 Reply
akumar2301
Specialist II
Specialist II

did you try Athena ? 

https://help.talend.com/reader/36FNvH4mND6uL3cKKzTU~w/2xTGZdvfowaLAOwQTtkNmA

 

It seems Athena can read S3 files.