Skip to main content
Announcements
A fresh, new look for the Data Integration & Quality forums and navigation! Read more about what's changed.
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Convert non-UTF8 encoded files to UTF-8 encoded format

Is there a way to Convert non-UTF8 encoded files to UTF-8 encoded format for Hadoop to parse the file efficiently
And replace the hexadecimal Field and Record delimiters to proper format in UTF-8 encoding? Can you illustrate thru an example?
Labels (2)
6 Replies
Anonymous
Not applicable
Author

Hi,
Have you checked component TalendHelpCenter:tChangeFileEncoding which transforms the character encoding of a given file and generates a new file with the transformed character encoding.
Best regards
Sabrina
Anonymous
Not applicable
Author

Hi Guys,

 

I am extracting data from json files with tFileInputJson component, it is not extracting UTF-8 Charcters by printing the output in tLogRow or tMySQLOutput. I have tried to change the encoding in advance settings but it didn't worked too.

Your help would be appriciated.

 

Thanks and Regards

Hitesh Mundra

Anonymous
Not applicable
Author

Hi Hitesh,

 

     Did you change the file encoding for the component to UTF-8 as shown below?

0683p000009M2d9.png

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved 🙂

Anonymous
Not applicable
Author

Hi @nthampi , I have already tried this but didn't worked.

akumar2301
Specialist II

Did you already try tchangefileencoding?

Otherwise try routine attached in below link

https://community.talend.com/t5/Design-and-Development/tChangeFileEncoding-and-UTF8-encoding/m-p/149...

Otherwise Attach a sample file
Anonymous
Not applicable
Author

Hi,

 

I have solved the issue by simply read file with tFileInputRaw and write tFileOutputRaw with UTF-8 Charcter set. This solved the issue.

 

Thanks and Regards

Hitesh Mundra