Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
I have a java library which consumes incoming files. The problem is, that it is only working if the file is utf8 encoded. Is there a component or a best practice to check this with talend? I want to reject files which are not utf8 encoded.
Thanks
Hello,
So far, there is no a component or a built-in function can be used to detect the file encoding. You could write a routine in Talend to parse the file encoding.
Best regards
Sabrina
Thanks for the reply,
Maybe someone has the same problem, i have solved it with an external jar
With that org.mozilla.universalchardet.UniversalDetector
https://github.com/albfernandez/juniversalchardet
it worked quite good. I have added a Java Routine and using it my job, if it is not return true, then I will throw an exception and the file will be handled differently
Hello,
Great it works. Thanks for sharing it with us on community.
Best regards
Sabrina