<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Loading from file to table in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257653#M39688</link>
    <description>&lt;BLOCKQUOTE&gt; 
 &lt;HR /&gt; 
 &lt;A href="https://community.qlik.com/s/profile/0053p000007LPFSAA4"&gt;@ksingh&lt;/A&gt;&amp;nbsp;wrote: 
 &lt;BR /&gt; 
 &lt;P&gt;&lt;STRONG&gt;Always&lt;/STRONG&gt; when loading from file to Db,&lt;STRONG&gt; use a Schemacompliancecheck&lt;/STRONG&gt; component before inserting into DB, &lt;STRONG&gt;use Manual commit every time&lt;/STRONG&gt;. When the records are loading from file to db, we need to increase our memory in the job. before a run try to do memory run.&amp;nbsp;&lt;/P&gt; 
 &lt;P&gt;follow this design&lt;/P&gt; 
 &lt;P&gt;tfileinput =&amp;gt; tmap =&amp;gt; tschemacompliance =&amp;gt; tmssqloutput/Any db&lt;/P&gt; 
 &lt;HR /&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;P&gt;not 100% agree&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;this does not work for huge files, not first, not second&lt;/P&gt; 
&lt;P&gt;it will well work for a relatively small number of rows ... but what about 10M rows?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;if data is dirty - you need to create &lt;STRONG&gt;separate&lt;/STRONG&gt; job for clean them&lt;/P&gt; 
&lt;P&gt;if data is huge - the single transaction will kill your&amp;nbsp;talend and your database server, so you need to commit more often rather than try to commit all at the end&lt;/P&gt;</description>
    <pubDate>Wed, 13 Feb 2019 22:28:33 GMT</pubDate>
    <dc:creator>vapukov</dc:creator>
    <dc:date>2019-02-13T22:28:33Z</dc:date>
    <item>
      <title>Loading from file to table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257650#M39685</link>
      <description>I have file like this file.txt which may contain millions of records. I want to load into target. When I am executing the job it may get error in middle of the record. I resolve the error again I execute the job the data will load from starting to ending ? Or wherever the error is got after the records will load why? 
&lt;BR /&gt;</description>
      <pubDate>Mon, 11 Feb 2019 23:52:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257650#M39685</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-11T23:52:40Z</dc:date>
    </item>
    <item>
      <title>Re: Loading from file to table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257651#M39686</link>
      <description>&lt;P&gt;hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;it depend from your job design - does you handle this or not&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;by default - all will continue from begin&lt;/P&gt;
&lt;P&gt;if you target support insert ignore and contain primary key, job could avoid duplicates&lt;/P&gt;
&lt;P&gt;if not, you must manage this manually, as variant - use autocommit, check number of loaded in target, read file from row N+1&lt;/P&gt;</description>
      <pubDate>Tue, 12 Feb 2019 01:14:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257651#M39686</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2019-02-12T01:14:24Z</dc:date>
    </item>
    <item>
      <title>Re: Loading from file to table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257652#M39687</link>
      <description>&lt;P&gt;Always when loading from file to Db, use a Schemacompliancecheck component before inserting into DB, use Manual commit every time. When the records are loading from file to db, we need to increase our memory in the job. before a run try to do memory run.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;follow this design&lt;/P&gt;
&lt;P&gt;tfileinput =&amp;gt; tmap =&amp;gt; tschemacompliance =&amp;gt; tmssqloutput/Any db&lt;/P&gt;</description>
      <pubDate>Wed, 13 Feb 2019 21:47:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257652#M39687</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-13T21:47:36Z</dc:date>
    </item>
    <item>
      <title>Re: Loading from file to table</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257653#M39688</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;HR /&gt; 
 &lt;A href="https://community.qlik.com/s/profile/0053p000007LPFSAA4"&gt;@ksingh&lt;/A&gt;&amp;nbsp;wrote: 
 &lt;BR /&gt; 
 &lt;P&gt;&lt;STRONG&gt;Always&lt;/STRONG&gt; when loading from file to Db,&lt;STRONG&gt; use a Schemacompliancecheck&lt;/STRONG&gt; component before inserting into DB, &lt;STRONG&gt;use Manual commit every time&lt;/STRONG&gt;. When the records are loading from file to db, we need to increase our memory in the job. before a run try to do memory run.&amp;nbsp;&lt;/P&gt; 
 &lt;P&gt;follow this design&lt;/P&gt; 
 &lt;P&gt;tfileinput =&amp;gt; tmap =&amp;gt; tschemacompliance =&amp;gt; tmssqloutput/Any db&lt;/P&gt; 
 &lt;HR /&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;P&gt;not 100% agree&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;this does not work for huge files, not first, not second&lt;/P&gt; 
&lt;P&gt;it will well work for a relatively small number of rows ... but what about 10M rows?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;if data is dirty - you need to create &lt;STRONG&gt;separate&lt;/STRONG&gt; job for clean them&lt;/P&gt; 
&lt;P&gt;if data is huge - the single transaction will kill your&amp;nbsp;talend and your database server, so you need to commit more often rather than try to commit all at the end&lt;/P&gt;</description>
      <pubDate>Wed, 13 Feb 2019 22:28:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Loading-from-file-to-table/m-p/2257653#M39688</guid>
      <dc:creator>vapukov</dc:creator>
      <dc:date>2019-02-13T22:28:33Z</dc:date>
    </item>
  </channel>
</rss>

