Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Qlik Open Lakehouse is Now Generally Available! Discover the key highlights and partner resources here.
cancel
Showing results for 
Search instead for 
Did you mean: 
PhilHibbs
Creator II
Creator II

Fastest way to process S3 files?

I'm currently processing a lot of S3 files by downloading them with tS3Get, reading them as sequential files, writing out sequential files, and then uploading those files back to S3 with tS3Put.

 

Is there a better way - a way of reading the data directly from S3 without going to the local filesystem?

 

I have a similar process running in a Python script and not in Talend, and I recently rewrote it to use a module called s3fs that lets me treat s3 just like a filesystem so I can open the file, read it, and write directly back to S3.

 

Is there any way of doing this in Talend?

Labels (3)
1 Reply
akumar2301
Specialist II
Specialist II

did you try Athena ? 

https://help.talend.com/reader/36FNvH4mND6uL3cKKzTU~w/2xTGZdvfowaLAOwQTtkNmA

 

It seems Athena can read S3 files.