Skip to main content
Announcements
July 15, NEW Customer Portal: Initial launch will improve how you submit Support Cases. IMPORTANT DETAILS
cancel
Showing results for 
Search instead for 
Did you mean: 
Parikhharshal
Creator III
Creator III

Convert .gz file to parquet or ORC format

Hi there

 

I am have bunch of .gz files stored on S3 bucket in AWS and want to convert them into Parquet format to achieve max performance for Redshift Spectrum.

 

Is it possible to do it thru Talend? And how can I do it?

 

Thanks

Harshal.

Labels (2)
1 Reply
manodwhb
Creator III
Creator III

You have to download .gz file from s3 using Talend and you need to read that .gz file and you need to convert.below is the link to read .gz file.
https://community.talend.com/t5/Design-and-Development/Unarchive-the-gzip-files/td-p/45659