<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: !!! Fighting hard with control characters !!! in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341574#M109516</link>
    <description>Hi Pedro! 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;This is what it looks like: 
&lt;BR /&gt;Part of the data from MySQL: "... continued its mission..." - I believe the strange character getS encoded to "" when using a tAdvancedFileOutput component. I'd like to replace it with "'" or get rid of it altogether... 
&lt;BR /&gt;There are a number of similar characters but I thought that if I get rid of the one above first then I could use the same approach for the rest 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Cheers!</description>
    <pubDate>Wed, 29 Feb 2012 08:57:31 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-02-29T08:57:31Z</dc:date>
    <item>
      <title>!!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341572#M109514</link>
      <description>Hi! 
&lt;BR /&gt;I'm running Talend 5.0.1 and I'm fighting hard to get rid of, or replace, control characters... This is what I have: 
&lt;BR /&gt;1. tMysqlInput reading from a MySQL database with an utf8_general_ci encoding where some of the characters appear as an "em symbol" in the query output 
&lt;BR /&gt;2. tReplace where I'm trying to replace "\u0025" with an emty string "" 
&lt;BR /&gt;3. tMap component 
&lt;BR /&gt;4. tAdvancedFileOutput where the output encoding is set to UTF8 
&lt;BR /&gt;I thought I'd remove the problem by enclose the text with "&amp;lt;!]&amp;gt;" but it didn't help 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MPcz.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/157233iD1A564EF62DE3BC2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MPcz.png" alt="0683p000009MPcz.png" /&gt;&lt;/span&gt; - Furthermore the tReplace component seems to be unable to replace the single "em" character by looking for "\u0025". If I don't enclose the text with the CDATA directive I get written to the file which causes problems when I try to index the XML in another system... 
&lt;BR /&gt;Hope you're able to help me here because I'm 100% stuck with this... 
&lt;BR /&gt;Many thanks!</description>
      <pubDate>Sat, 16 Nov 2024 12:20:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341572#M109514</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:20:38Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341573#M109515</link>
      <description>Hi&lt;BR /&gt;Now let's simplify this issue.&lt;BR /&gt;Show me details, such as Input Data, Expected Data.&lt;BR /&gt;Regards,&lt;BR /&gt;Pedro</description>
      <pubDate>Wed, 29 Feb 2012 08:50:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341573#M109515</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T08:50:02Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341574#M109516</link>
      <description>Hi Pedro! 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;This is what it looks like: 
&lt;BR /&gt;Part of the data from MySQL: "... continued its mission..." - I believe the strange character getS encoded to "" when using a tAdvancedFileOutput component. I'd like to replace it with "'" or get rid of it altogether... 
&lt;BR /&gt;There are a number of similar characters but I thought that if I get rid of the one above first then I could use the same approach for the rest 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Cheers!</description>
      <pubDate>Wed, 29 Feb 2012 08:57:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341574#M109516</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T08:57:31Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341575#M109517</link>
      <description>Hi&lt;BR /&gt;Set tRaplace as the following image.&lt;BR /&gt;Don't check "Whole word".&lt;BR /&gt;Or I misunderstood what you mean?&lt;BR /&gt;Regards,&lt;BR /&gt;Pedro</description>
      <pubDate>Wed, 29 Feb 2012 09:07:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341575#M109517</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T09:07:07Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341576#M109518</link>
      <description>Hi Pedro! 
&lt;BR /&gt;Thnaks for your suggestion. I do have a question though: the character I wanted to remove/replace was not the "..." but the single character that looked "strange" in my posting. The "..." was included to show that there were leading and trailing text 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Cheers!</description>
      <pubDate>Wed, 29 Feb 2012 09:27:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341576#M109518</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T09:27:31Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341577#M109519</link>
      <description>Hi&lt;BR /&gt;Got it this time.&lt;BR /&gt;Do as the image above.&lt;BR /&gt;I' m not sure whether tReplace can replace special character.&lt;BR /&gt;But you can try it.&lt;BR /&gt;The only thing is that don't check "Whole word".&lt;BR /&gt;Regards,&lt;BR /&gt;Pedro</description>
      <pubDate>Wed, 29 Feb 2012 09:35:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341577#M109519</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T09:35:14Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341578#M109520</link>
      <description>Uh uh - this sounds a bit troublesome. Do you think I'm able to do something like this instead:&lt;BR /&gt;row2.Summary.replace("\u0025", "")&lt;BR /&gt;otherwise specify "\u0025" while using tReplace?&lt;BR /&gt;Cheers!</description>
      <pubDate>Wed, 29 Feb 2012 10:04:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341578#M109520</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T10:04:27Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341579#M109521</link>
      <description>Hi 
&lt;BR /&gt;This issue is the same with Euro Symbol. 
&lt;BR /&gt;I tried to replace ? with "". 
&lt;BR /&gt;But unfortunately it doesn't work. I reported on BugTracker several days before. 
&lt;BR /&gt;You may try it with Java method. 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Pedro</description>
      <pubDate>Wed, 29 Feb 2012 10:13:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341579#M109521</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T10:13:14Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341580#M109522</link>
      <description>Hi Pedro!&lt;BR /&gt;Many thanks for your help. I hope this get solved soon. In the meantime I need to find another solution...&lt;BR /&gt;Cheers</description>
      <pubDate>Wed, 29 Feb 2012 10:15:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341580#M109522</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T10:15:07Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341581#M109523</link>
      <description>Is the data actually UTF8? If so it would display properly.</description>
      <pubDate>Wed, 29 Feb 2012 11:10:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341581#M109523</guid>
      <dc:creator>janhess</dc:creator>
      <dc:date>2012-02-29T11:10:24Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341582#M109524</link>
      <description>Hmmmm, the characters I'm trying to remove are \u0019, \u0025, \u0028 and \u0029 and they're shown as "strange single character" characters 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;I checked in the original file and it is UTF8-encoded... 
&lt;BR /&gt;Cheers</description>
      <pubDate>Wed, 29 Feb 2012 11:27:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341582#M109524</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-02-29T11:27:34Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341583#M109525</link>
      <description>Hi!&lt;BR /&gt;I managed to fix a workaround. This is what I did:&lt;BR /&gt;Using a tMap component, I invoked the 'replaceAll' method on the column causing the problem: &amp;lt;column&amp;gt;.replaceAll("","")&lt;BR /&gt;I hope that it will become possible to use a similar approach using the tReplace component in the future.&lt;BR /&gt;Cheers!</description>
      <pubDate>Thu, 01 Mar 2012 08:19:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341583#M109525</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-01T08:19:15Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341584#M109526</link>
      <description>Hi,
&lt;BR /&gt;Couldn't you use the tReplace component with some regular expression that allows standard characters only? I'm not that good with regex, but somthing like allows only alphanumeric characters including spaces.
&lt;BR /&gt;Hope this helps.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Arno</description>
      <pubDate>Thu, 01 Mar 2012 08:43:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341584#M109526</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-01T08:43:16Z</dc:date>
    </item>
    <item>
      <title>Re: !!! Fighting hard with control characters !!!</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341585#M109527</link>
      <description>@avdbrink: 
&lt;BR /&gt;hmmm, not sure - I tried to replace something like "\u000c" for example but never got it to work with tReplace... it could be that I provided the parameters wrongly but... 
&lt;BR /&gt;The "workaround" I applied works fine soo I'll stick with that for the time being. Thanks for your suggestion though 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MACn.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154443iC5B8CACEF3D12C6A/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MACn.png" alt="0683p000009MACn.png" /&gt;&lt;/span&gt; 
&lt;BR /&gt;Cheers</description>
      <pubDate>Thu, 01 Mar 2012 08:48:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Fighting-hard-with-control-characters/m-p/2341585#M109527</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-03-01T08:48:58Z</dc:date>
    </item>
  </channel>
</rss>

