<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tChangeFileEncoding and UTF8 encoding in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313283#M84153</link>
    <description>&lt;P&gt;If anyone ever has the same problem, here is how I solved mine. As a reminder, I needed to change the encoding of a file.csv from ANSI to UTF-8.&amp;nbsp;And I also had a problem with my UTF-8 file when I opened it with Excel.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;First things first,&amp;nbsp;it is apparently well known that excel has trouble dealing with files.csv in UTF-8. (&lt;A href="https://stackoverflow.com/questions/6002256/is-it-possible-to-force-excel-recognize-utf-8-csv-files-automatically" target="_self" rel="nofollow noopener noreferrer"&gt;example here&lt;/A&gt;). And since, the file didn't have to be used in Excel in the end, I just ignore that part.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Secondly, I found that my file was not encoded in ISO-8859-15 (aka Latin-9) as I thought it was natively but in Latin-1.&amp;nbsp;I tried using the option "Custom" encoding from the tFileChangeEncoding to do the job, but it was not as intuitive as I thought it would be. So I used a tJava component + a custom routine to solve this problem. For the routine, I used the java.NIO library and I found &lt;A href="https://docs.oracle.com/javase/8/docs/technotes/guides/intl/encoding.doc.html" target="_self" rel="nofollow noopener noreferrer"&gt;here&lt;/A&gt; all the encoding supported by this library. My encoding is/was "windows-1252".&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;After that, I simply had to call my routine like :&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;myPackage.MyCustomRoutine.myMethod&amp;nbsp;(input_encoding, output_encoding, input_directory+input_filename, out&lt;SPAN&gt;put_&lt;/SPAN&gt;&lt;SPAN&gt;directory+output_filename&lt;/SPAN&gt;);&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 29 May 2018 11:03:12 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2018-05-29T11:03:12Z</dc:date>
    <item>
      <title>tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313276#M84146</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have an input_file encoded in ANSI that I want to encode to UTF-8.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So basically, I use the&amp;nbsp;tChangeFileEncoding component and&amp;nbsp;I do get an output_file encoded in UTF-8. While I open it with notepad++, everything is alright.&amp;nbsp;&lt;/P&gt;&lt;P&gt;But when i open it with Excel, "€" and "é" caracters show me things like "â‚¬_" and "Ã©".&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there any way to fix this ?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 May 2018 10:53:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313276#M84146</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-07T10:53:47Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313277#M84147</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LONOAA4"&gt;@JoshyBrown&lt;/A&gt;,what type of file are you using to change the coding?&lt;/P&gt;</description>
      <pubDate>Mon, 07 May 2018 10:58:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313277#M84147</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2018-05-07T10:58:18Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313278#M84148</link>
      <description>&lt;P&gt;It's a .csv file.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 May 2018 11:00:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313278#M84148</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-07T11:00:54Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313279#M84149</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LONOAA4"&gt;@JoshyBrown&lt;/A&gt;,based on the encoding those characters will be converted into special characters.&lt;/P&gt;</description>
      <pubDate>Mon, 07 May 2018 11:22:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313279#M84149</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2018-05-07T11:22:19Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313280#M84150</link>
      <description>&lt;A href="https://community.qlik.com/s/profile/0053p000007LKmJAAW"&gt;@manodwhb&lt;/A&gt;, Is there a way to change/by pass that and obtain a proper .csv file when opened with excel ?</description>
      <pubDate>Mon, 07 May 2018 11:47:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313280#M84150</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-07T11:47:20Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313281#M84151</link>
      <description>&lt;P&gt;I started to get a grasp on your awnser and the solution to fix my problem is to use the &lt;A href="https://en.wikipedia.org/wiki/Byte_order_mark" target="_self" rel="nofollow noopener noreferrer"&gt;BOM&lt;/A&gt;. 
 &lt;STRIKE&gt;
   Unfortunately, while using tFileChangeEncoding and indicating "UTF-8-BOM", Talend can not recognize it and therefore deliver a proper output file.&amp;nbsp; 
 &lt;/STRIKE&gt;&lt;/P&gt; 
&lt;P&gt; 
 &lt;STRIKE&gt;
   Anyone knows how to use the BOM in Talend ? Or use the custom encoding option ?&amp;nbsp; 
 &lt;/STRIKE&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;*edit*&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Ok, it's not how it works. I have found &lt;A href="https://community.qlik.com/s/feed/0D53p00007vCn5aCAC" target="_self"&gt;this topic&lt;/A&gt; which is related to my problem. Apparently, I need to use a custom component in order to use BOM. BOM is not native on Talend. But maybe the previous topic is too old. I can't find the&amp;nbsp;&lt;A href="http://www.talendforge.org/exchange/tos/extension_view.php?eid=328" target="_blank" rel="nofollow noopener noreferrer noopener noreferrer"&gt;tWriteHeaderLineToFileWithBOM&lt;/A&gt;&amp;nbsp;component. Is there a way to download it or did the OP retrieve it ?&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The key to my problem is the BOM. I'm sure of it. Once I can download, install and use that custom component, my problem will be solved.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 09 May 2018 14:17:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313281#M84151</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-09T14:17:40Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313282#M84152</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Could you please refer to this link about:&lt;A title="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F257&amp;amp;pi=marketplace%252F1%252Fproducts%252F257%252Fitems%252F363" href="https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F257&amp;amp;pi=marketplace%252F1%252Fproducts%252F257%252Fitems%252F363" target="_self" rel="nofollow noopener noreferrer"&gt;https://exchange.talend.com/#marketplaceproductoverview:marketplace=marketplace%252F1&amp;amp;p=marketplace%252F1%252Fproducts%252F257&amp;amp;pi=marketplace%252F1%252Fproducts%252F257%252Fitems%252F363&lt;/A&gt;?&lt;/P&gt;
&lt;P&gt;And feel free to let us know if you can download this custom component from talend exchange portal.&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 04:49:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313282#M84152</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-11T04:49:05Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313283#M84153</link>
      <description>&lt;P&gt;If anyone ever has the same problem, here is how I solved mine. As a reminder, I needed to change the encoding of a file.csv from ANSI to UTF-8.&amp;nbsp;And I also had a problem with my UTF-8 file when I opened it with Excel.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;First things first,&amp;nbsp;it is apparently well known that excel has trouble dealing with files.csv in UTF-8. (&lt;A href="https://stackoverflow.com/questions/6002256/is-it-possible-to-force-excel-recognize-utf-8-csv-files-automatically" target="_self" rel="nofollow noopener noreferrer"&gt;example here&lt;/A&gt;). And since, the file didn't have to be used in Excel in the end, I just ignore that part.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Secondly, I found that my file was not encoded in ISO-8859-15 (aka Latin-9) as I thought it was natively but in Latin-1.&amp;nbsp;I tried using the option "Custom" encoding from the tFileChangeEncoding to do the job, but it was not as intuitive as I thought it would be. So I used a tJava component + a custom routine to solve this problem. For the routine, I used the java.NIO library and I found &lt;A href="https://docs.oracle.com/javase/8/docs/technotes/guides/intl/encoding.doc.html" target="_self" rel="nofollow noopener noreferrer"&gt;here&lt;/A&gt; all the encoding supported by this library. My encoding is/was "windows-1252".&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;After that, I simply had to call my routine like :&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;myPackage.MyCustomRoutine.myMethod&amp;nbsp;(input_encoding, output_encoding, input_directory+input_filename, out&lt;SPAN&gt;put_&lt;/SPAN&gt;&lt;SPAN&gt;directory+output_filename&lt;/SPAN&gt;);&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 11:03:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313283#M84153</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-05-29T11:03:12Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313284#M84154</link>
      <description>&lt;P&gt;Hello Joshy,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please, can you share your routine please ?&lt;BR /&gt;&lt;BR /&gt;Thanks you !!&lt;/P&gt;</description>
      <pubDate>Sun, 27 Jan 2019 09:24:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313284#M84154</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-27T09:24:40Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313285#M84155</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LPCqAAO"&gt;@sasafca&lt;/A&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You'll find it in the join piece to this message.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hoping it will help.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;BR /&gt;&lt;A href="https://community.qlik.com/legacyfs/online/tlnd_dw_files/0683p000009LukA"&gt;routine_encoding.txt&lt;/A&gt;</description>
      <pubDate>Mon, 28 Jan 2019 08:51:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313285#M84155</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-28T08:51:36Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313286#M84156</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LONOAA4"&gt;@JoshyBrown&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You are a Genius !!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks you so much !&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jan 2019 10:37:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313286#M84156</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-29T10:37:01Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313287#M84157</link>
      <description>&lt;P&gt;I'll take the compliment even though this problem gave me quite a hard time&amp;nbsp;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACJ.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/133049iD780B7DE0116E4D1/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACJ.png" alt="0683p000009MACJ.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jan 2019 10:40:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313287#M84157</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-29T10:40:15Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313288#M84158</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LONOAA4"&gt;@JoshyBrown&lt;/A&gt;, do you know how to convert CSV UTF-8 to UTF-8-BOM please ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;My client need a CSV in UTF-8... with BOM for special caraters... (not "&lt;SPAN&gt;windows-1252"&lt;/SPAN&gt;&amp;nbsp;&lt;SPAN class="tw-bilingual-translation"&gt;regrettably)&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="tw-bilingual-translation"&gt;Thanks you &lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jan 2019 13:04:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313288#M84158</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-29T13:04:39Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313289#M84159</link>
      <description>Sorry, that I don't know. I didn't have to go through this problem.</description>
      <pubDate>Tue, 29 Jan 2019 13:25:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313289#M84159</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-01-29T13:25:00Z</dc:date>
    </item>
    <item>
      <title>Re: tChangeFileEncoding and UTF8 encoding</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313290#M84160</link>
      <description>&lt;P&gt;Hi @David Joshua​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Can you please give details for tjava component custom routine?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 24 May 2022 05:35:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tChangeFileEncoding-and-UTF8-encoding/m-p/2313290#M84160</guid>
      <dc:creator>CDol1613983942</dc:creator>
      <dc:date>2022-05-24T05:35:43Z</dc:date>
    </item>
  </channel>
</rss>

