<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to perform Incremental load from a file. in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306687#M78200</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have a source file which I receive on daily basis and it has data appended from previous day's data and some updated records. Since it is file there is no key column or any date fields. How can I perform Incremental load using Talend. Is there any component to fetch most recent data from file. Please assist.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks a lot.&lt;/P&gt;</description>
    <pubDate>Thu, 26 Oct 2017 06:17:02 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-10-26T06:17:02Z</dc:date>
    <item>
      <title>How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306687#M78200</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have a source file which I receive on daily basis and it has data appended from previous day's data and some updated records. Since it is file there is no key column or any date fields. How can I perform Incremental load using Talend. Is there any component to fetch most recent data from file. Please assist.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks a lot.&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2017 06:17:02 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306687#M78200</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-26T06:17:02Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306688#M78201</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;One way is&amp;nbsp;to keep track of the records that are read earlier at the db level. When the job read the file again, it can fetch the value from the db and using that you can&amp;nbsp;change you header value.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;You can keep the header value in the context that can be updated on each run.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Hope that helps !&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Best Regards,&lt;/P&gt; 
&lt;P&gt;Abhishek&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2017 07:04:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306688#M78201</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-26T07:04:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306689#M78202</link>
      <description>&lt;P&gt;Hi Abhishek,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your reply. Is there any function like MD5 in talend to fetch the updated data and new data from source file. Based on MD5 checksum value will compare with source and target, changes/null in checksum value will treat as new or update.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please let me know if we have any MD5 similar in Talend.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards,&lt;/P&gt;
&lt;P&gt;Venkat.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 09:36:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306689#M78202</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-27T09:36:31Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306690#M78203</link>
      <description>&lt;P&gt;Yes, Talend have a component tAddCRCRow which&amp;nbsp;&lt;SPAN&gt;calculates a surrogate key based on one or several columns and adds it to the defined schema.&lt;/SPAN&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For details you can refer the documentation at the below link -&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://help.talend.com/reader/iYcvdknuprDzYycT3WRU8w/yLVlrTzch~QO2QOqd7KzZg" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/reader/iYcvdknuprDzYycT3WRU8w/yLVlrTzch~QO2QOqd7KzZg&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 10:13:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306690#M78203</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-27T10:13:09Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306691#M78204</link>
      <description>&lt;P&gt;Hi Abhishek,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks for your prompt reply. I am trying to learn and explore more on talend. I tried using&amp;nbsp;&lt;SPAN&gt;tAddCRCRow and initial&amp;nbsp;load it generated a unique numeric value in CRC column for all the records. Next run I have few new records and changes to existing records. for new records it is getting new crc value but for updated once no change. How to compare the previous(first load)&amp;nbsp; target file and current source file to fetch updated records.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN&gt;Thanks in advance...&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 10:56:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306691#M78204</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-27T10:56:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306692#M78205</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;What I know and have faced in our implementation, new CRC do get created whenever there is a change in the value of any column.&lt;/P&gt; 
&lt;P&gt;However we faced the same problem with tAddCRCRow when we were pushing more than 1 million of records. With that many number of records&amp;nbsp; there is a probability, we may get duplicate CRC.&lt;/P&gt; 
&lt;P&gt;So, in that case its not a reliable solution.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Best Regards,&lt;/P&gt; 
&lt;P&gt;Abhishek&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Best Regards&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 11:47:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306692#M78205</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-27T11:47:50Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306693#M78206</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LMHzAAO"&gt;@abhishek&lt;/A&gt;,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;You'll need to store the CRC value for each record somewhere, usually a database table&amp;nbsp;created for this,&amp;nbsp;and then do a lookup and compare them to see if they've changed since the job was last run.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;However, you'll need a unique identifier for each record to store&amp;nbsp;along with the CRC, so you can then look up the previous value, and you've&amp;nbsp;stated&amp;nbsp;that there is no key for the input data.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;If, and only if, you can guarantee that&amp;nbsp;the order of the records in your source file won't change, and there's absolutely no chance of records being deleted, just new ones added to the end of the file, then you could add a new integer field into the flow, probably with a tMap, then use Talend's Sequence function to&amp;nbsp;populate this for each row, and use this&amp;nbsp;as your key when storing and looking up the previous CRC value.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Just out of interest, if your source data has no unique key, how do you intend updating the existing records in your&amp;nbsp;database?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Regards,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Chris&lt;/P&gt;</description>
      <pubDate>Fri, 27 Oct 2017 12:05:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306693#M78206</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-10-27T12:05:03Z</dc:date>
    </item>
    <item>
      <title>Re: How to perform Incremental load from a file.</title>
      <link>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306694#M78207</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have read all messages but all the steps mentioned above are for talend studio but is there any step we can do in talend data fabric.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Sasidhar&lt;/P&gt;</description>
      <pubDate>Fri, 23 Sep 2022 08:10:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/How-to-perform-Incremental-load-from-a-file/m-p/2306694#M78207</guid>
      <dc:creator>Tech_wiz</dc:creator>
      <dc:date>2022-09-23T08:10:04Z</dc:date>
    </item>
  </channel>
</rss>

