<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: [resolved] Encoding issue with tfileOutputDelimited in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280845#M55572</link>
    <description>Finally... Produced file is UTF-8, as expected, but without BOM.&lt;BR /&gt;My bad, the default configuration for my tool (ultraedit32) was to translate file to UTF when recognized as such, showing wrong BOM in my case (and adding it if saved).&lt;BR /&gt;I'll set this post as resolved.</description>
    <pubDate>Thu, 16 Jun 2011 10:43:38 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2011-06-16T10:43:38Z</dc:date>
    <item>
      <title>[resolved] Encoding issue with tfileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280842#M55569</link>
      <description>I use a tfileOutputDelimited with encoding set on UTF-8 (default) in advanced parameters.
&lt;BR /&gt;Nevertheless the produced file has an UTF-16LE BOM (FF FE) with UTF-16LE character encoding.
&lt;BR /&gt;I tried to pipe a tChangeFileEncoding (UTF16-&amp;gt;UTF8) with and without custom input encoding.
&lt;BR /&gt;Both tests failed, i'm stuck with utf16.
&lt;BR /&gt;Any idea ?
&lt;BR /&gt;Frankie.
&lt;BR /&gt;BTW : I use TOS Version: 4.1.2
&lt;BR /&gt;Build id: r53616-20110106-0635</description>
      <pubDate>Wed, 15 Jun 2011 10:03:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280842#M55569</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-06-15T10:03:29Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Encoding issue with tfileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280843#M55570</link>
      <description>Hi
&lt;BR /&gt;Can you send me an example file for testing? 
&lt;BR /&gt;Best regards
&lt;BR /&gt;Shong</description>
      <pubDate>Wed, 15 Jun 2011 15:24:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280843#M55570</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-06-15T15:24:37Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Encoding issue with tfileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280844#M55571</link>
      <description>Seems it is related a " Will Not Fix" Java Bug. 
&lt;BR /&gt; 
&lt;A href="http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4508058" rel="nofollow noopener noreferrer"&gt;http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4508058&lt;/A&gt; 
&lt;BR /&gt;A quick and dirty workaround (in a tJava) : 
&lt;BR /&gt; 
&lt;PRE&gt;//reading as UTF-16LE&lt;BR /&gt;FileInputStream fis = new FileInputStream("inpufile.txt");&lt;BR /&gt;BufferedReader r = new BufferedReader(new InputStreamReader(fis, "UTF-16LE"));&lt;BR /&gt;//writing as UTF-8&lt;BR /&gt;FileOutputStream fos = new FileOutputStream("ouputfile.txt");&lt;BR /&gt;Writer w = new BufferedWriter(new OutputStreamWriter(fos, "UTF-8"));&lt;BR /&gt;//copy data&lt;BR /&gt;for (String s = ""; (s = r.readLine()) != null;) {&lt;BR /&gt;	w.write(s + System.getProperty("line.separator"));&lt;BR /&gt;	w.flush();&lt;BR /&gt;}&lt;BR /&gt;//closing streams&lt;BR /&gt;w.close();&lt;BR /&gt;r.close();&lt;/PRE&gt; 
&lt;BR /&gt;The BOM is still wrong, but the encoding is right. 
&lt;BR /&gt;I did not find a convenient way to put binary files online. 
&lt;BR /&gt;So here is a small example of what I mean : 
&lt;BR /&gt;- Actual input data (readable) : ? 
&lt;BR /&gt; (LATIN CAPITAL LETTER A WITH DIAERESIS + DEGREE SIGN) 
&lt;BR /&gt;- Correct UTF-16LE (hexa) : FF FE 00 C4 00 B0 
&lt;BR /&gt; as written by talend in my case (supposed to be utf8) 
&lt;BR /&gt;- Actual output file (hexa mixed) : FF FE C3 84 C2 B0 
&lt;BR /&gt; after the above quick'n dirty conversion 
&lt;BR /&gt;- Expected output (hexa utf8) : EF BB BF C3 84 C2 B0 
&lt;BR /&gt;Edit : Oops seems that ultraedit converts automatically to utf16 when opening. Trying with a decent binary viwer/editor now.</description>
      <pubDate>Thu, 16 Jun 2011 09:52:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280844#M55571</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-06-16T09:52:18Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Encoding issue with tfileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280845#M55572</link>
      <description>Finally... Produced file is UTF-8, as expected, but without BOM.&lt;BR /&gt;My bad, the default configuration for my tool (ultraedit32) was to translate file to UTF when recognized as such, showing wrong BOM in my case (and adding it if saved).&lt;BR /&gt;I'll set this post as resolved.</description>
      <pubDate>Thu, 16 Jun 2011 10:43:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280845#M55572</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-06-16T10:43:38Z</dc:date>
    </item>
    <item>
      <title>Re: [resolved] Encoding issue with tfileOutputDelimited</title>
      <link>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280846#M55573</link>
      <description>&lt;P&gt;Hi &lt;BR /&gt;Glad to see that you find the cause! Maybe you can try this component &lt;A href="https://exchange.talend.com/#marketplacesearch:gallery=marketplace%252F1&amp;amp;ob=releaseDate&amp;amp;o=0&amp;amp;c=20&amp;amp;d=true&amp;amp;s=tWriteHeaderLineToFileWithBOM&amp;amp;f=&amp;amp;tf=&amp;amp;bf=&amp;amp;cat=&amp;amp;title=Search%2520Results" target="_blank" rel="nofollow noopener noreferrer"&gt;tWriteHeaderLineToFileWithBOM&lt;/A&gt; to output the records with BOM.&lt;BR /&gt;&lt;BR /&gt;Best regards&lt;BR /&gt;Shong&lt;/P&gt;</description>
      <pubDate>Thu, 16 Jun 2011 11:18:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/resolved-Encoding-issue-with-tfileOutputDelimited/m-p/2280846#M55573</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-06-16T11:18:46Z</dc:date>
    </item>
  </channel>
</rss>

