<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Encoding issue with tFileOutputXML in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258565#M40329</link>
    <description>? is not part of the 8859-1 character set and since can never be written to a file encoded in 8859-1.
&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;I expect talend to encode it to "?"&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;What? How exactly do you want Talend to change that? As said, it is not part of the character set.
&lt;BR /&gt;If you want ? either use 8859-15 change to UTF-8.</description>
    <pubDate>Thu, 23 Aug 2012 14:07:57 GMT</pubDate>
    <dc:creator>_AnonymousUser</dc:creator>
    <dc:date>2012-08-23T14:07:57Z</dc:date>
    <item>
      <title>Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258563#M40327</link>
      <description>Hello, 
&lt;BR /&gt;I'm using a tFileOutputXML to write a simple XML file. I must use ISO-8859-1 as encoding, this works well if I set this as a Custom encoding in Advanced Options. But if there is a character outside ISO-8859-1 (for instance "?"), talend just outputs "?". 
&lt;BR /&gt;I expect talend to encode it to "&amp;amp; #8364;" (without space) : this is correctly decoded back to "?" when I use a tFileInputXML, why is this behavior not consistent ? 
&lt;BR /&gt;A workaround is to set UTF-8 encoding on the tFileoutputXML and then use a transformation to get the XML in the mandatory encoding. 
&lt;BR /&gt;Did anyone had the same issue ? Do you think a bug report/request for enhancement for this has any chance of getting some attention ? 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Eric 
&lt;BR /&gt;edit : I'm using talend 5.0.1</description>
      <pubDate>Tue, 21 Aug 2012 13:43:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258563#M40327</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-08-21T13:43:50Z</dc:date>
    </item>
    <item>
      <title>Re: Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258564#M40328</link>
      <description>Hi
&lt;BR /&gt;You have to use UTF-8 to read or write the special character "?", I don't think you can read it correctly from file without utf-8 encoding.</description>
      <pubDate>Thu, 23 Aug 2012 13:41:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258564#M40328</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-08-23T13:41:15Z</dc:date>
    </item>
    <item>
      <title>Re: Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258565#M40329</link>
      <description>? is not part of the 8859-1 character set and since can never be written to a file encoded in 8859-1.
&lt;BR /&gt;
&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;I expect talend to encode it to "?"&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;What? How exactly do you want Talend to change that? As said, it is not part of the character set.
&lt;BR /&gt;If you want ? either use 8859-15 change to UTF-8.</description>
      <pubDate>Thu, 23 Aug 2012 14:07:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258565#M40329</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2012-08-23T14:07:57Z</dc:date>
    </item>
    <item>
      <title>Re: Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258566#M40330</link>
      <description>Sorry, the forum broke everything. I'll edit my post : of course an ? in  a file encoded in 8859-1 wouldn't be possible, what I meant is "&amp;amp; #8364;" without space.&lt;BR /&gt;I can't control the encoding, I'm writing this file for a legacy app. Even ISO-8859-15 would be enough but I simply can't.</description>
      <pubDate>Thu, 23 Aug 2012 16:54:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258566#M40330</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-08-23T16:54:56Z</dc:date>
    </item>
    <item>
      <title>Re: Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258567#M40331</link>
      <description>Ok, yes, that makes more sense. Talend has no built in way to do this. Best thing would be to set up a new routine and use this (not tested by me):
&lt;BR /&gt;
&lt;A href="http://stackoverflow.com/questions/1273986/converting-utf-8-to-iso-8859-1-in-java" rel="nofollow noopener noreferrer"&gt;http://stackoverflow.com/questions/1273986/converting-utf-8-to-iso-8859-1-in-java&lt;/A&gt;</description>
      <pubDate>Fri, 24 Aug 2012 08:06:11 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258567#M40331</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2012-08-24T08:06:11Z</dc:date>
    </item>
    <item>
      <title>Re: Encoding issue with tFileOutputXML</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258568#M40332</link>
      <description>&lt;BLOCKQUOTE&gt;
 &lt;TABLE border="1"&gt;
  &lt;TBODY&gt;
   &lt;TR&gt;
    &lt;TD&gt;Ok, yes, that makes more sense. Talend has no built in way to do this. Best thing would be to set up a new routine and use this (not tested by me):&lt;BR /&gt;&lt;A href="http://stackoverflow.com/questions/1273986/converting-utf-8-to-iso-8859-1-in-java" rel="nofollow noopener noreferrer"&gt;http://stackoverflow.com/questions/1273986/converting-utf-8-to-iso-8859-1-in-java&lt;/A&gt;&lt;/TD&gt;
   &lt;/TR&gt;
  &lt;/TBODY&gt;
 &lt;/TABLE&gt;
&lt;/BLOCKQUOTE&gt;
&lt;BR /&gt;Won't the &amp;amp; be encoded when I use the tFileOutputXML, ruining this improvised encoding ?</description>
      <pubDate>Fri, 24 Aug 2012 11:40:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Encoding-issue-with-tFileOutputXML/m-p/2258568#M40332</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-08-24T11:40:59Z</dc:date>
    </item>
  </channel>
</rss>

