<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Talend Open Studio tMysqlSCD - special characters cause new version always in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283425#M57347</link>
    <description>&lt;P&gt;I'm having an issue with the tMysqlSCD component.&amp;nbsp; I ran a load, and then ran another load without changing the input data.&amp;nbsp; So my expectation would be that there would be no new Type2 SCD records generated as nothing changed.&amp;nbsp; Surprise! I have some new SCD versions!&amp;nbsp; What?&amp;nbsp; HOW?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It appears that if the column being checked for changes contains special characters (in this case linefeed LF) there is some flaw in the SCD type 2 detection that will always create a new version.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is a huge pain as a lot of our text fields can have special characters.&amp;nbsp; Any way to get around this?&amp;nbsp; I'm on 6.5.1 should that matter.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;thanks, Bryan&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 11 Jan 2019 23:44:43 GMT</pubDate>
    <dc:creator>bglaplante</dc:creator>
    <dc:date>2019-01-11T23:44:43Z</dc:date>
    <item>
      <title>Talend Open Studio tMysqlSCD - special characters cause new version always</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283425#M57347</link>
      <description>&lt;P&gt;I'm having an issue with the tMysqlSCD component.&amp;nbsp; I ran a load, and then ran another load without changing the input data.&amp;nbsp; So my expectation would be that there would be no new Type2 SCD records generated as nothing changed.&amp;nbsp; Surprise! I have some new SCD versions!&amp;nbsp; What?&amp;nbsp; HOW?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It appears that if the column being checked for changes contains special characters (in this case linefeed LF) there is some flaw in the SCD type 2 detection that will always create a new version.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is a huge pain as a lot of our text fields can have special characters.&amp;nbsp; Any way to get around this?&amp;nbsp; I'm on 6.5.1 should that matter.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;thanks, Bryan&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Jan 2019 23:44:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283425#M57347</guid>
      <dc:creator>bglaplante</dc:creator>
      <dc:date>2019-01-11T23:44:43Z</dc:date>
    </item>
    <item>
      <title>Re: Talend Open Studio tMysqlSCD - special characters cause new version always</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283426#M57348</link>
      <description>&lt;P&gt;Actually - a correction.&amp;nbsp; It isn't a LF character causing the issue (as trim() doesn't remove it).&amp;nbsp; A dump of an offending string finds extended ASCII characters, in this case characters 0xC2 and 0xAC.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Looking at the source system the data is "ul Podbipięty 27/1"&amp;nbsp; Note the special ę&amp;nbsp; character.&amp;nbsp; I think this is causing the issue.&amp;nbsp; The target table is MySql utf8mb4 --- but in MySQL workbench the character no longer looks like ę but instead is showing as a ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;My hunch here is that somehow the extended character is being modified by the time it lands in MySQL, and then the SCD is finding it to be 'different' (well it is), and then tries to do a type 2 update, which just repeats the problem.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 18:39:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283426#M57348</guid>
      <dc:creator>bglaplante</dc:creator>
      <dc:date>2019-01-14T18:39:38Z</dc:date>
    </item>
    <item>
      <title>Re: Talend Open Studio tMysqlSCD - special characters cause new version always</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283427#M57349</link>
      <description>&lt;P&gt;Solution:&lt;/P&gt;
&lt;P&gt;in MySqlConnection, additional jdbc parameters, add utf8, for example:&lt;/P&gt;
&lt;P&gt;noDatetimeStringSync=true&amp;amp;characterEncoding=utf8&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Then the UTF characters are loaded to MySQL properly, and thus the SCD doesn't see differences.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 21:26:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Talend-Open-Studio-tMysqlSCD-special-characters-cause-new/m-p/2283427#M57349</guid>
      <dc:creator>bglaplante</dc:creator>
      <dc:date>2019-01-14T21:26:07Z</dc:date>
    </item>
  </channel>
</rss>

