<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Problem with &amp;quot;ellipse&amp;quot; character and treplace component in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-quot-ellipse-quot-character-and-treplace-component/m-p/2277506#M53267</link>
    <description>&lt;P&gt;I think this might be a bug or limitation.&amp;nbsp; If you have a text file with the following line:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;LF05.03|91|For nooks…||&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The delimiter is the pipe "|" and notice there is the win1252 character ellipse "…"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When you use the treplace with pattern "^(LF05.+?)\\|$", substitution "$1", it should strip off the ending pipe character, but it does not work with the ellipse character in the file.&amp;nbsp; It does work if you take the ellipse out.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm doing some cleanup on a big file and treplace is failing when specific characters are present, such as the ellipse.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 08 Jan 2019 18:33:06 GMT</pubDate>
    <dc:creator>billimmer</dc:creator>
    <dc:date>2019-01-08T18:33:06Z</dc:date>
    <item>
      <title>Problem with "ellipse" character and treplace component</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-quot-ellipse-quot-character-and-treplace-component/m-p/2277506#M53267</link>
      <description>&lt;P&gt;I think this might be a bug or limitation.&amp;nbsp; If you have a text file with the following line:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;LF05.03|91|For nooks…||&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The delimiter is the pipe "|" and notice there is the win1252 character ellipse "…"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When you use the treplace with pattern "^(LF05.+?)\\|$", substitution "$1", it should strip off the ending pipe character, but it does not work with the ellipse character in the file.&amp;nbsp; It does work if you take the ellipse out.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm doing some cleanup on a big file and treplace is failing when specific characters are present, such as the ellipse.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Jan 2019 18:33:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problem-with-quot-ellipse-quot-character-and-treplace-component/m-p/2277506#M53267</guid>
      <dc:creator>billimmer</dc:creator>
      <dc:date>2019-01-08T18:33:06Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with "ellipse" character and treplace component</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Problem-with-quot-ellipse-quot-character-and-treplace-component/m-p/2277507#M53268</link>
      <description>&lt;P&gt;This modified expression will work:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;^(LF05\\.[^\\n]*)\\|$&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Note that the problem is that the regex dot "." will not match the ellipse character.&amp;nbsp; this is a problem to be aware of if your input file is using windows 1252 encoding&lt;/P&gt;</description>
      <pubDate>Tue, 08 Jan 2019 20:30:35 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Problem-with-quot-ellipse-quot-character-and-treplace-component/m-p/2277507#M53268</guid>
      <dc:creator>billimmer</dc:creator>
      <dc:date>2019-01-08T20:30:35Z</dc:date>
    </item>
  </channel>
</rss>

