<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Umlauts in UTF-8 in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296100#M68798</link>
    <description>Hi Jonathan,
&lt;BR /&gt;I think the output of the job should be in the format you specify in the output component. So if your input contains UTF-8 and you read this into Talend it will convert it to an internal format, but when exporting, you should be able to select the desired format again, UTF-8 for example. This should give you a file or table with the correct data.
&lt;BR /&gt;Hope this helps.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Arno</description>
    <pubDate>Thu, 24 May 2012 11:45:26 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-05-24T11:45:26Z</dc:date>
    <item>
      <title>Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296098#M68796</link>
      <description>Hello&lt;BR /&gt;I'm transforming an XML file. The source file has umlaut characters which have been converted to their utf-8 equivalents. For example the source file contains:&lt;BR /&gt;&amp;lt;TOWNCITY&amp;gt;Düsseldorf&amp;lt;/TOWNCITY&amp;gt;&lt;BR /&gt;When I transform this into an new XML format, TOS converts this to:&lt;BR /&gt;&amp;lt;TOWNCITY&amp;gt;Düsseldorf&amp;lt;/TOWNCITY&amp;gt;&lt;BR /&gt;I would like to preserve the original, but I can't figure it out. Both the source and the output file are configured to be UTF-8 encoding.&lt;BR /&gt;Any ideas how I can achieve this?&lt;BR /&gt;Thanks for your help.</description>
      <pubDate>Thu, 24 May 2012 11:02:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296098#M68796</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-05-24T11:02:40Z</dc:date>
    </item>
    <item>
      <title>Re: Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296099#M68797</link>
      <description>Hmmm.... the character has been converted in the post too. So the original should be:&lt;BR /&gt;&amp;lt;TOWNCITY&amp;gt;D &amp;amp; # 2 5 2 ; sseldorf&amp;lt;/TOWNCITY&amp;gt;&lt;BR /&gt;With spaces, so that it does not convert</description>
      <pubDate>Thu, 24 May 2012 11:03:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296099#M68797</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-05-24T11:03:51Z</dc:date>
    </item>
    <item>
      <title>Re: Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296100#M68798</link>
      <description>Hi Jonathan,
&lt;BR /&gt;I think the output of the job should be in the format you specify in the output component. So if your input contains UTF-8 and you read this into Talend it will convert it to an internal format, but when exporting, you should be able to select the desired format again, UTF-8 for example. This should give you a file or table with the correct data.
&lt;BR /&gt;Hope this helps.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Arno</description>
      <pubDate>Thu, 24 May 2012 11:45:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296100#M68798</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-05-24T11:45:26Z</dc:date>
    </item>
    <item>
      <title>Re: Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296101#M68799</link>
      <description>Arno
&lt;BR /&gt;Thanks for the reply. I'm doing as you suggest - the source file is read as UTF-8 and the output I create is also UTF-8, but Talend is still converting the data to the umlaut character. Maybe its a bug - I can't find any configuration parameters that will change this.
&lt;BR /&gt;Jonathan</description>
      <pubDate>Thu, 24 May 2012 11:53:19 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296101#M68799</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-05-24T11:53:19Z</dc:date>
    </item>
    <item>
      <title>Re: Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296102#M68800</link>
      <description>If it's a bug you could get round it by replacing in a tMap or tReplace but it will probably affect a number of characters.</description>
      <pubDate>Thu, 24 May 2012 12:00:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296102#M68800</guid>
      <dc:creator>janhess</dc:creator>
      <dc:date>2012-05-24T12:00:12Z</dc:date>
    </item>
    <item>
      <title>Re: Umlauts in UTF-8</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296103#M68801</link>
      <description>Yes - I tried to post process it with a tReplace - no luck with this either I'm afraid - it still converts back to the umlaut character.</description>
      <pubDate>Thu, 24 May 2012 12:48:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Umlauts-in-UTF-8/m-p/2296103#M68801</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-05-24T12:48:00Z</dc:date>
    </item>
  </channel>
</rss>

