<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to handle increment update in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487241#M141405</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Please have a look at CDC feature, introduced in Qlik &lt;SPAN class="keyword"&gt;Talend Studio&lt;/SPAN&gt; which quickly identifies and captures data that has been added to, updated in, or removed from database tables and makes this change data available for future use by applications or individuals. The CDC feature is available for Oracle, MySQL, DB2, PostgreSQL, Sybase, MS SQL Server, Informix, Ingres, Teradata, and AS/400.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://help.qlik.com/talend/en-US/studio-user-guide/8.0-R2024-09/studio-user-guide/change-data-capture-cdc" target="_blank"&gt;https://help.qlik.com/talend/en-US/studio-user-guide/8.0-R2024-09/studio-user-guide/change-data-capture-cdc&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
    <pubDate>Wed, 16 Oct 2024 02:33:29 GMT</pubDate>
    <dc:creator>Xiaodi_Shi</dc:creator>
    <dc:date>2024-10-16T02:33:29Z</dc:date>
    <item>
      <title>How to handle increment update</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487144#M141401</link>
      <description>&lt;P&gt;We're trying to sync records from multiple data sources to one place, and we try to detect the duplicate records and let the end user select the master record from the duplicate group. Here my approach:&lt;/P&gt;
&lt;P&gt;1. Get all of the records from the data sources and use the tUnite to merge the records and pass to tMatchGroup.&lt;/P&gt;
&lt;P&gt;2. In the tMatchGroup group the duplicate records and then pass the group records to Data Stewardship to let the user detect the master record.&lt;/P&gt;
&lt;P&gt;It works for one time sync. But if any data source has record (s) created or updated, we still need to transfer to the end data source. We need to do a duplication check for the new record (s) as well.&lt;/P&gt;
&lt;P&gt;With step #2, it will generate duplication group for all of the records again (include old records), any way to only detect the new record (s) duplication group? Or any other good approach for it?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Oct 2024 16:19:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487144#M141401</guid>
      <dc:creator>BenjaminLiu</dc:creator>
      <dc:date>2024-10-15T16:19:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to handle increment update</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487241#M141405</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Please have a look at CDC feature, introduced in Qlik &lt;SPAN class="keyword"&gt;Talend Studio&lt;/SPAN&gt; which quickly identifies and captures data that has been added to, updated in, or removed from database tables and makes this change data available for future use by applications or individuals. The CDC feature is available for Oracle, MySQL, DB2, PostgreSQL, Sybase, MS SQL Server, Informix, Ingres, Teradata, and AS/400.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://help.qlik.com/talend/en-US/studio-user-guide/8.0-R2024-09/studio-user-guide/change-data-capture-cdc" target="_blank"&gt;https://help.qlik.com/talend/en-US/studio-user-guide/8.0-R2024-09/studio-user-guide/change-data-capture-cdc&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Wed, 16 Oct 2024 02:33:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487241#M141405</guid>
      <dc:creator>Xiaodi_Shi</dc:creator>
      <dc:date>2024-10-16T02:33:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to handle increment update</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487243#M141407</link>
      <description>&lt;P&gt;Thanks for the reply! It will help us to catch the new changes.&lt;/P&gt;
&lt;P&gt;But my next issue is how to detect the duplicates for the new changes. Just use the tMatchGroup again to group all of the duplicate records which includes the old records, or any other way to only get the new change records duplications.&lt;/P&gt;
&lt;P&gt;Our case is we always need to check if the current syncing records have the duplicate records with the syncing records and persist records, then let user to manually select the only one master record.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Oct 2024 03:03:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487243#M141407</guid>
      <dc:creator>BenjaminLiu</dc:creator>
      <dc:date>2024-10-16T03:03:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to handle increment update</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487441#M141411</link>
      <description>&lt;P&gt;You need to select for the new records the matching records in your target and provide both of them as new match group.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 16 Oct 2024 15:50:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487441#M141411</guid>
      <dc:creator>jlolling</dc:creator>
      <dc:date>2024-10-16T15:50:05Z</dc:date>
    </item>
    <item>
      <title>Re: How to handle increment update</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487559#M141412</link>
      <description>&lt;P&gt;Ok, thanks! I just curious about the performance.&lt;/P&gt;
&lt;P&gt;We always need to compare the new records with the matching records. From my understanding, it will compare between the matching records again, wanna know any way only check the new records duplication from the whole records? Then the performance should be better.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Oct 2024 06:48:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-handle-increment-update/m-p/2487559#M141412</guid>
      <dc:creator>BenjaminLiu</dc:creator>
      <dc:date>2024-10-17T06:48:00Z</dc:date>
    </item>
  </channel>
</rss>

