Skip to main content
Announcements
A fresh, new look for the Data Integration & Quality forums and navigation! Read more about what's changed.
cancel
Showing results for 
Search instead for 
Did you mean: 
j_brunck
Contributor

Dfile.encoding=UTF-8 does not seem to be working well

we are running Talend Big data 6.2.1 and had an issue where we were receiving extended characters sets from source data, ASCII values around 200 or so when loading these they were showing as unknown characters. but only when running via a bat file from the build process, when running in Talend workspace they were ok. found we needed

Dfile.encoding=UTF-8 added in the bat file

parameters

. we were

successful

in loading these extended character sets for a very simple test job, but when we are running more complicated jobs that have multiple subjobs this now does not work, it is kind of like the parameter is not being passed through all the jobs. any thoughts?

 

I found that if i set the custom encoding to be UTF-8 in the tHDFSOutput component, this seems to work all the time. curious that the parameter is not passing correctly when not set though

Labels (3)
1 Reply
Anonymous
Not applicable

Hello,

Are you getting encoding issue when running it in studio or TAC, TMC and server?

For studio side, you can set it;

From the Job level:

Click the Run tab > Advanced settings > select Use specific JVM arguments > New > enter -Dfile.encoding=UTF-8.

Feel free to let us know if it helps.

Best regards

Sabrina