<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tAddCRCRow returns duplicate values for different inputs in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tAddCRCRow-returns-duplicate-values-for-different-inputs/m-p/2548244#M149551</link>
    <description>&lt;P&gt;Hello RahulReliable,&lt;BR /&gt;&lt;BR /&gt;Thank you for reaching out to the &lt;STRONG&gt;Qlik community&lt;/STRONG&gt;.&lt;BR /&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://talendskill.com/talend-for-esb-docs/docs-7-x/taddcrcrow-talend-open-studio-for-esb-document-7-x/" target="_blank"&gt;tAddCRCRow &lt;/A&gt;does &lt;STRONG&gt;not guarantee unique values&lt;/STRONG&gt;. It computes a &lt;STRONG&gt;CRC (hash) from the selected columns&lt;/STRONG&gt;, and &lt;STRONG&gt;different inputs can legitimately produce the same CRC&lt;/STRONG&gt; (called a &lt;I&gt;collision&lt;/I&gt;), especially if:&lt;/P&gt;
&lt;UL&gt;
 &lt;LI&gt;You selected &lt;STRONG&gt;only one column&lt;/STRONG&gt; or too few columns,&lt;/LI&gt;
 &lt;LI&gt;you are using a &lt;STRONG&gt;short CRC type&lt;/STRONG&gt; (CRC8/CRC16),&lt;/LI&gt;
 &lt;LI&gt;values differ only by &lt;STRONG&gt;spaces, case, or trimming rules&lt;/STRONG&gt;, or&lt;/LI&gt;
 &lt;LI&gt;The data size is large.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;So, &lt;STRONG&gt;different column values do NOT always mean different CRC values&lt;/STRONG&gt; by design.&lt;BR /&gt;&lt;BR /&gt;To deal with this, we recommend the following.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
 &lt;LI&gt;&lt;STRONG&gt;Select all business key columns&lt;/STRONG&gt; that define uniqueness (not just one). &lt;A href="https://help.qlik.com/talend/en-US/components/8.0/identification/taddcrcrow-tfileinputdelimited-tlogrow-tlogrow-configuring-taddcrcrow-component-standard-component-in" target="_blank"&gt;document&lt;/A&gt;.&lt;/LI&gt;
 &lt;LI&gt;Use &lt;A href="https://community.qlik.com/t5/Talend-Studio/tADDCRCRow-component-duplicate-CRC-issue/td-p/2309903" target="_blank"&gt;&lt;STRONG&gt;CRC32&lt;/STRONG&gt;&lt;/A&gt; (longer CRC → fewer collisions).&amp;nbsp;&lt;/LI&gt;
 &lt;LI&gt;Normalize data first (TRIM, UPPER/LOWER) before tAddCRCRow. &lt;A href="https://community.qlik.com/t5/Talend-Studio/Talend-4-2-3-taddCRCrow-same-CRC-value-for-2-different-data-set/td-p/2329442" target="_blank"&gt;document&lt;/A&gt;.&lt;/LI&gt;
 &lt;LI&gt;&lt;STRONG&gt;Do NOT rely on CRC as a guaranteed join key&lt;/STRONG&gt;—use it only as a &lt;I&gt;helper checksum&lt;/I&gt;.&lt;/LI&gt;
&lt;/UL&gt;</description>
    <pubDate>Fri, 01 May 2026 10:48:59 GMT</pubDate>
    <dc:creator>Rahul_Kale</dc:creator>
    <dc:date>2026-05-01T10:48:59Z</dc:date>
    <item>
      <title>tAddCRCRow returns duplicate values for different inputs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tAddCRCRow-returns-duplicate-values-for-different-inputs/m-p/2548163#M149549</link>
      <description>&lt;P&gt;Background: In talend Data Management studio, We are loading data to multiple tables and in each table we are using taddCRCRow for one of our common column.&amp;nbsp;&lt;BR /&gt;later we are joining these tables based on this column in database as per our client's requirement.&lt;BR /&gt;&lt;BR /&gt;issue : in a table , we have different values to our columns. when we are connecting it to tAddCRCRow,&amp;nbsp; we are getting same results for two columns three times.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RahulReliable_0-1777537512604.png" style="width: 400px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/188079i35CA79D104C19FAF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RahulReliable_0-1777537512604.png" alt="RahulReliable_0-1777537512604.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="RahulReliable_1-1777537562278.png" style="width: 400px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/188080iCE4C4B8DE60C818F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="RahulReliable_1-1777537562278.png" alt="RahulReliable_1-1777537562278.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If my column have different values , it should return different CRC value as well ?&lt;BR /&gt;please advice.&lt;/P&gt;</description>
      <pubDate>Thu, 30 Apr 2026 08:28:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tAddCRCRow-returns-duplicate-values-for-different-inputs/m-p/2548163#M149549</guid>
      <dc:creator>RahulReliable</dc:creator>
      <dc:date>2026-04-30T08:28:04Z</dc:date>
    </item>
    <item>
      <title>Re: tAddCRCRow returns duplicate values for different inputs</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tAddCRCRow-returns-duplicate-values-for-different-inputs/m-p/2548244#M149551</link>
      <description>&lt;P&gt;Hello RahulReliable,&lt;BR /&gt;&lt;BR /&gt;Thank you for reaching out to the &lt;STRONG&gt;Qlik community&lt;/STRONG&gt;.&lt;BR /&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://talendskill.com/talend-for-esb-docs/docs-7-x/taddcrcrow-talend-open-studio-for-esb-document-7-x/" target="_blank"&gt;tAddCRCRow &lt;/A&gt;does &lt;STRONG&gt;not guarantee unique values&lt;/STRONG&gt;. It computes a &lt;STRONG&gt;CRC (hash) from the selected columns&lt;/STRONG&gt;, and &lt;STRONG&gt;different inputs can legitimately produce the same CRC&lt;/STRONG&gt; (called a &lt;I&gt;collision&lt;/I&gt;), especially if:&lt;/P&gt;
&lt;UL&gt;
 &lt;LI&gt;You selected &lt;STRONG&gt;only one column&lt;/STRONG&gt; or too few columns,&lt;/LI&gt;
 &lt;LI&gt;you are using a &lt;STRONG&gt;short CRC type&lt;/STRONG&gt; (CRC8/CRC16),&lt;/LI&gt;
 &lt;LI&gt;values differ only by &lt;STRONG&gt;spaces, case, or trimming rules&lt;/STRONG&gt;, or&lt;/LI&gt;
 &lt;LI&gt;The data size is large.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;So, &lt;STRONG&gt;different column values do NOT always mean different CRC values&lt;/STRONG&gt; by design.&lt;BR /&gt;&lt;BR /&gt;To deal with this, we recommend the following.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL&gt;
 &lt;LI&gt;&lt;STRONG&gt;Select all business key columns&lt;/STRONG&gt; that define uniqueness (not just one). &lt;A href="https://help.qlik.com/talend/en-US/components/8.0/identification/taddcrcrow-tfileinputdelimited-tlogrow-tlogrow-configuring-taddcrcrow-component-standard-component-in" target="_blank"&gt;document&lt;/A&gt;.&lt;/LI&gt;
 &lt;LI&gt;Use &lt;A href="https://community.qlik.com/t5/Talend-Studio/tADDCRCRow-component-duplicate-CRC-issue/td-p/2309903" target="_blank"&gt;&lt;STRONG&gt;CRC32&lt;/STRONG&gt;&lt;/A&gt; (longer CRC → fewer collisions).&amp;nbsp;&lt;/LI&gt;
 &lt;LI&gt;Normalize data first (TRIM, UPPER/LOWER) before tAddCRCRow. &lt;A href="https://community.qlik.com/t5/Talend-Studio/Talend-4-2-3-taddCRCrow-same-CRC-value-for-2-different-data-set/td-p/2329442" target="_blank"&gt;document&lt;/A&gt;.&lt;/LI&gt;
 &lt;LI&gt;&lt;STRONG&gt;Do NOT rely on CRC as a guaranteed join key&lt;/STRONG&gt;—use it only as a &lt;I&gt;helper checksum&lt;/I&gt;.&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Fri, 01 May 2026 10:48:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tAddCRCRow-returns-duplicate-values-for-different-inputs/m-p/2548244#M149551</guid>
      <dc:creator>Rahul_Kale</dc:creator>
      <dc:date>2026-05-01T10:48:59Z</dc:date>
    </item>
  </channel>
</rss>

