<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Duplicate Processing, Attribute level survivorship, Redshift, Compare Rows in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310783#M81880</link>
    <description>I suggest you first use tAggregateRow&lt;BR /&gt;&lt;BR /&gt;It's offers the possibility to get fist/last value&lt;BR /&gt;</description>
    <pubDate>Thu, 11 Jul 2019 13:02:18 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-07-11T13:02:18Z</dc:date>
    <item>
      <title>Duplicate Processing, Attribute level survivorship, Redshift, Compare Rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310782#M81879</link>
      <description>&lt;P&gt;Hello Everyone,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have an upcoming design question.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I will have a mapping that populates a Redshift table with approximately 80 fields. The aim will be to never insert a duplicate. However, there is a need to for FIELD level survivorship to be applied, where the rule will be take the latest data for a particular ID but do not replace if the latest data is null and the old data is populated. This check needs to happen for each field.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;What would be the best method to adopt. Will be inserting approx. 100k rows every few hours. Updates to redshift are not allowed, inserts only.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The only way I can think of is to push this back to the database via a large cumbersome SQL case statements.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Any help would be much appreciated.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 05:19:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310782#M81879</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T05:19:09Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate Processing, Attribute level survivorship, Redshift, Compare Rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310783#M81880</link>
      <description>I suggest you first use tAggregateRow&lt;BR /&gt;&lt;BR /&gt;It's offers the possibility to get fist/last value&lt;BR /&gt;</description>
      <pubDate>Thu, 11 Jul 2019 13:02:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310783#M81880</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-07-11T13:02:18Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate Processing, Attribute level survivorship, Redshift, Compare Rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310784#M81881</link>
      <description>Hi,
&lt;BR /&gt;
&lt;BR /&gt;Thats not exactly what I am looking for.
&lt;BR /&gt;
&lt;BR /&gt;Essentially want to compare two rows that share a unique ID, and take the best data forward. If already populated, populate with a the new data unless NULL. If NULL then replace with new data.
&lt;BR /&gt;
&lt;BR /&gt;Any ideas?</description>
      <pubDate>Mon, 15 Jul 2019 15:31:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310784#M81881</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-07-15T15:31:33Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate Processing, Attribute level survivorship, Redshift, Compare Rows</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310785#M81882</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; Please refer whether the below scenario matches your need. You may have to make customization based on your specific use case. But the essence is as specified in the below flow.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://help.talend.com/reader/A3Qm3pq~qSkLMBOtSJbPhg/a4UPALg0m66CapZ9UbCQIw" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/reader/A3Qm3pq~qSkLMBOtSJbPhg/a4UPALg0m66CapZ9UbCQIw&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt;
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 15 Jul 2019 17:46:51 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-Processing-Attribute-level-survivorship-Redshift/m-p/2310785#M81882</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-07-15T17:46:51Z</dc:date>
    </item>
  </channel>
</rss>

