
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Dfile.encoding=UTF-8 does not seem to be working well
we are running Talend Big data 6.2.1 and had an issue where we were receiving extended characters sets from source data, ASCII values around 200 or so when loading these they were showing as unknown characters. but only when running via a bat file from the build process, when running in Talend workspace they were ok. found we needed
Dfile.encoding=UTF-8 added in the bat file
parameters
. we were
successful
in loading these extended character sets for a very simple test job, but when we are running more complicated jobs that have multiple subjobs this now does not work, it is kind of like the parameter is not being passed through all the jobs. any thoughts?
I found that if i set the custom encoding to be UTF-8 in the tHDFSOutput component, this seems to work all the time. curious that the parameter is not passing correctly when not set though

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello,
Are you getting encoding issue when running it in studio or TAC, TMC and server?
For studio side, you can set it;
From the Job level:
Click the Run tab > Advanced settings > select Use specific JVM arguments > New > enter -Dfile.encoding=UTF-8.
Feel free to let us know if it helps.
Best regards
Sabrina
