<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Recovery on failed job in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205669#M6033</link>
    <description>Checkpoints can certainly help with job recovery. However, there is a potential hidden issue when pulling data from relational databases. Relational databases in general, and SQL by definition, are non-deterministic regarding the sequence in which rows are returned. 
&lt;BR /&gt;For example, if data has been added to the source between runs, then the new data may occur in the middle of the data that has already been written in the previous run. The job has to consider the possibility of new data occurring in the middle of data that has already been written. An order by clause can reduce the issue, but may or may not be sufficient. An order by clause can also affect performance of the job. There are ways to reduce this problem, by including things like an inserted data, or last updated date in the where clause and in the order by clause. 
&lt;BR /&gt;Consider checkpoints. Also consider using a flush-and-fill strategy for your job. If flush-and-fill is not possible, consider using something like a last-updated date in your where clause and order by clause to reduce the data issues.</description>
    <pubDate>Fri, 08 Nov 2013 15:09:33 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-11-08T15:09:33Z</dc:date>
    <item>
      <title>Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205665#M6029</link>
      <description>Hi, 
&lt;BR /&gt;Could anybody tell me is there any option available in Talend Open Studio Data integration to read the record from where it has stopped? 
&lt;BR /&gt;Suppose I have 10 million records in source. I started the job execution and after some time (may be after 1 hr) due to some issue (network etc.), my job failed. When the job failed, already 1 million record transferred to target. What will happen to the record if i re-run the job again. Is the job going to read data from the begining and target data will be truncated or will it read from the position where it stopped and data will be appended in target? 
&lt;BR /&gt; 
&lt;BR /&gt;Thanks in advance. 
&lt;BR /&gt;Best regards, 
&lt;BR /&gt;Sisir</description>
      <pubDate>Thu, 07 Nov 2013 08:47:28 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205665#M6029</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2013-11-07T08:47:28Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205666#M6030</link>
      <description>The job will read data from the beginning.&lt;BR /&gt;Target data may be truncated or may be appended depending on how you have the target component configured.  Please let us know which component you are using for a target, and we can better explain how to configure the component for your needs.</description>
      <pubDate>Thu, 07 Nov 2013 16:57:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205666#M6030</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-11-07T16:57:38Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205667#M6031</link>
      <description>Hi Wayne, 
&lt;BR /&gt;Thanks for your reply. 
&lt;BR /&gt;Could you please tell me is there any option available in Talend that reads data from the particular point where it stopped. 
&lt;BR /&gt;I set commit interval for target component 10000. I don't want to load previous records in my target. If the job gets failed due to any issue(say network or anything else), then for next run, it has to read the data from where it left. It should not bring again the previous records which are already been committed in target. Let's say I am moving data from one Oracle table to another Oracle table. 
&lt;BR /&gt; 
&lt;BR /&gt;Best regards, 
&lt;BR /&gt;Sisir</description>
      <pubDate>Fri, 08 Nov 2013 09:37:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205667#M6031</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-11-08T09:37:58Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205668#M6032</link>
      <description>There are checkpoint that can help recover from job execution failure: &lt;A href="https://help.talend.com/search/all?query=How+to+recover+Job+execution+in+case+of+failure&amp;amp;content-lang=en" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/search/all?query=How+to+recover+Job+execution+in+case+of+failure&amp;amp;content-lang=en&lt;/A&gt;&lt;BR /&gt;but this is only available in the subscription versions of Talend.</description>
      <pubDate>Fri, 08 Nov 2013 14:09:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205668#M6032</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-11-08T14:09:44Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205669#M6033</link>
      <description>Checkpoints can certainly help with job recovery. However, there is a potential hidden issue when pulling data from relational databases. Relational databases in general, and SQL by definition, are non-deterministic regarding the sequence in which rows are returned. 
&lt;BR /&gt;For example, if data has been added to the source between runs, then the new data may occur in the middle of the data that has already been written in the previous run. The job has to consider the possibility of new data occurring in the middle of data that has already been written. An order by clause can reduce the issue, but may or may not be sufficient. An order by clause can also affect performance of the job. There are ways to reduce this problem, by including things like an inserted data, or last updated date in the where clause and in the order by clause. 
&lt;BR /&gt;Consider checkpoints. Also consider using a flush-and-fill strategy for your job. If flush-and-fill is not possible, consider using something like a last-updated date in your where clause and order by clause to reduce the data issues.</description>
      <pubDate>Fri, 08 Nov 2013 15:09:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205669#M6033</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-11-08T15:09:33Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205670#M6034</link>
      <description>Thanks Wayne.&lt;BR /&gt;I will try to implement the tips you provided. Post the result once I get significant outcome.&lt;BR /&gt;&lt;BR /&gt;Best regards,&lt;BR /&gt;Sisir</description>
      <pubDate>Thu, 14 Nov 2013 05:25:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205670#M6034</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-11-14T05:25:56Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205671#M6035</link>
      <description>Hi, 
&lt;BR /&gt;how to set a commit interval on target?. where can i find the option to enter the number . for example i want to set commit interval of 10000. where can i do it. is that option available in open studio?</description>
      <pubDate>Fri, 27 Jun 2014 08:20:31 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205671#M6035</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-27T08:20:31Z</dc:date>
    </item>
    <item>
      <title>Re: Recovery on failed job</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205672#M6036</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;Hi, &lt;BR /&gt;how to set a commit interval on target?. where can i find the option to enter the number . for example i want to set commit interval of 10000. where can i do it. is that option available in open studio?&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;The commit internal option is usually in the advanced settings tab of database output component, such as tMysqlOutput. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Shong</description>
      <pubDate>Fri, 27 Jun 2014 08:54:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Recovery-on-failed-job/m-p/2205672#M6036</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2014-06-27T08:54:27Z</dc:date>
    </item>
  </channel>
</rss>

