<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Usage for large data tMap lookup and deleting rejects data in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361890#M126140</link>
    <description>&lt;P&gt;Hi, currently we have this on our Talend. See below:&lt;BR /&gt;&lt;I&gt;source (lookup) table(&lt;STRONG&gt;tMSSQLInput &lt;/STRONG&gt;)&lt;/I&gt;&amp;nbsp;---&amp;gt; insert into&amp;nbsp;staging table (&lt;STRONG&gt;&lt;I&gt;tmssqloutput&lt;/I&gt;)&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;on sub job ok --&amp;gt; &lt;/STRONG&gt;Delete from table where data is not in source (lookup table) (&lt;STRONG&gt;&lt;I&gt;tMSSQLRow&lt;/I&gt;&lt;/STRONG&gt;)&lt;BR /&gt;&lt;BR /&gt;Basically we just need to delete data from Destination table if the data is missing from Lookup table. Our look up table can consist of huge amount of data, like 200 million.&amp;nbsp;&lt;BR /&gt;When we tested it on the large amount of data, it took us about more than 1 hour just to delete rejected records for just 1 table.&amp;nbsp;&lt;BR /&gt;What is the right approach for this? Should we use tMap to delete the rejected data?&lt;BR /&gt;&lt;BR /&gt;&lt;I&gt;source large data (tmap lookup table)&lt;/I&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;|&lt;BR /&gt;&lt;I&gt;Destination table(tMSSQLInput )&lt;/I&gt; --- &lt;STRONG&gt;tmap &lt;/STRONG&gt;--- &amp;nbsp;&lt;I&gt;Delete rejects in destination table&lt;/I&gt; (&lt;I&gt;tMSSQLOutput)&lt;/I&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Or is there a component that can do bulk delete?&lt;BR /&gt;Thanks in advance!&lt;/P&gt;</description>
    <pubDate>Tue, 19 Apr 2016 07:32:47 GMT</pubDate>
    <dc:creator>_AnonymousUser</dc:creator>
    <dc:date>2016-04-19T07:32:47Z</dc:date>
    <item>
      <title>Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361890#M126140</link>
      <description>&lt;P&gt;Hi, currently we have this on our Talend. See below:&lt;BR /&gt;&lt;I&gt;source (lookup) table(&lt;STRONG&gt;tMSSQLInput &lt;/STRONG&gt;)&lt;/I&gt;&amp;nbsp;---&amp;gt; insert into&amp;nbsp;staging table (&lt;STRONG&gt;&lt;I&gt;tmssqloutput&lt;/I&gt;)&amp;nbsp;&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;on sub job ok --&amp;gt; &lt;/STRONG&gt;Delete from table where data is not in source (lookup table) (&lt;STRONG&gt;&lt;I&gt;tMSSQLRow&lt;/I&gt;&lt;/STRONG&gt;)&lt;BR /&gt;&lt;BR /&gt;Basically we just need to delete data from Destination table if the data is missing from Lookup table. Our look up table can consist of huge amount of data, like 200 million.&amp;nbsp;&lt;BR /&gt;When we tested it on the large amount of data, it took us about more than 1 hour just to delete rejected records for just 1 table.&amp;nbsp;&lt;BR /&gt;What is the right approach for this? Should we use tMap to delete the rejected data?&lt;BR /&gt;&lt;BR /&gt;&lt;I&gt;source large data (tmap lookup table)&lt;/I&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;|&lt;BR /&gt;&lt;I&gt;Destination table(tMSSQLInput )&lt;/I&gt; --- &lt;STRONG&gt;tmap &lt;/STRONG&gt;--- &amp;nbsp;&lt;I&gt;Delete rejects in destination table&lt;/I&gt; (&lt;I&gt;tMSSQLOutput)&lt;/I&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Or is there a component that can do bulk delete?&lt;BR /&gt;Thanks in advance!&lt;/P&gt;</description>
      <pubDate>Tue, 19 Apr 2016 07:32:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361890#M126140</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-04-19T07:32:47Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361891#M126141</link>
      <description>Hi,
&lt;BR /&gt;Have you tried to
&lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;&amp;nbsp;store the data on disk instead of memory on tMap?&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt;
&lt;BR /&gt;
&lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;Sabrina&lt;/FONT&gt;&lt;/FONT&gt;</description>
      <pubDate>Tue, 19 Apr 2016 08:40:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361891#M126141</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-19T08:40:21Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361892#M126142</link>
      <description>&lt;BLOCKQUOTE&gt;
  xdshi wrote: 
 &lt;BR /&gt;Hi, 
 &lt;BR /&gt;Have you tried to 
 &lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;&amp;nbsp;store the data on disk instead of memory on tMap?&lt;/FONT&gt;&lt;/FONT&gt; 
 &lt;BR /&gt; 
 &lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;Best regards&lt;/FONT&gt;&lt;/FONT&gt; 
 &lt;BR /&gt; 
 &lt;FONT size="2"&gt;&lt;FONT face="Calibri," sans-serif=""&gt;Sabrina&lt;/FONT&gt;&lt;/FONT&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Yup we have tried it too. We're also having a problem that when we run it on Talend Studio, we can see it's running but when the talend job is called from Web application, it seems that we're stuck on the SELECT statement when we checked on MS SQL Studio Management. Is there an issue with that? They have different behavior.&amp;nbsp;</description>
      <pubDate>Tue, 19 Apr 2016 09:46:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361892#M126142</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-04-19T09:46:04Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361893#M126143</link>
      <description>Hi,
&lt;BR /&gt;Is there any error messge printed on console? Could you please post your current job setting screenshots into forum which will be helpful for us to address your issue.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Tue, 19 Apr 2016 10:53:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361893#M126143</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-19T10:53:48Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361894#M126144</link>
      <description>Unfortunately, I cannot upload files. Actually I have a username here. I tried to login on this site but when redirecting to forum site, it will say "You are not logged in." But when I tried to click on log in, my username will appear. 
&lt;BR /&gt;Anyway, It's just a simple Mssqlinput component to tMap with look up table and it will delete the rejects data. 
&lt;BR /&gt; 
&lt;BR /&gt;look up table -------- 
&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;| 
&lt;BR /&gt;tMssqlInput ----&amp;gt; tMap -----&amp;gt; tMSSQLOutput 
&lt;BR /&gt; 
&lt;BR /&gt;The difference is the look up table is more than 200 million. We're using the store to temp drive in tMap but still no luck. The job that's running is stuck on select and the error will be connection closed. I think because of the large data that's why the connection to the database got stuck, what should we do about it? How do you handle millions of data? 
&lt;BR /&gt; 
&lt;BR /&gt;Thanks!</description>
      <pubDate>Tue, 19 Apr 2016 11:05:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361894#M126144</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-04-19T11:05:54Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361895#M126145</link>
      <description>I don't think that it is an issue of whether or not Talend can handle 200m records; but I think 200m is too many rows to have in a look-up.&lt;BR /&gt;I think you need to review your overall architecture and see if there is an alternative approach for identifying rows that should not be loaded.&lt;BR /&gt;If you have no alternative, then there are some cases where it makes sense to push the join down to your database, and this may be one of them.</description>
      <pubDate>Tue, 19 Apr 2016 11:35:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361895#M126145</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-19T11:35:41Z</dc:date>
    </item>
    <item>
      <title>Re: Usage for large data tMap lookup and deleting rejects data</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361896#M126146</link>
      <description>&lt;BLOCKQUOTE&gt;  tal00000 wrote:&lt;BR /&gt;I don't think that it is an issue of whether or not Talend can handle 200m records; but I think 200m is too many rows to have in a look-up.&lt;BR /&gt;I think you need to review your overall architecture and see if there is an alternative approach for identifying rows that should not be loaded.&lt;BR /&gt;If you have no alternative, then there are some cases where it makes sense to push the join down to your database, and this may be one of them.&lt;/BLOCKQUOTE&gt;&lt;BR /&gt;I agree with you. But we have no choice. The look up table is from other database that's why we're thinking to use tMap. &amp;nbsp;</description>
      <pubDate>Tue, 19 Apr 2016 13:21:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Usage-for-large-data-tMap-lookup-and-deleting-rejects-data/m-p/2361896#M126146</guid>
      <dc:creator>_AnonymousUser</dc:creator>
      <dc:date>2016-04-19T13:21:57Z</dc:date>
    </item>
  </channel>
</rss>

