<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: TMap and Lookups 20+M records in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244721#M30783</link>
    <description>&lt;P&gt;Thanks &lt;A href="https://community.qlik.com/s/profile/0053p000007LKpMAAW"&gt;@fdenis&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I use reload for each row - as far as I know this decreases the overall execution time&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 18 Sep 2019 16:12:15 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-09-18T16:12:15Z</dc:date>
    <item>
      <title>TMap and Lookups 20+M records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244719#M30781</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I was trying to convert Informatica mappings to Talend.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;Following table stats&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;1. Lookup-1: 28M (PostgreSQL Input with SQL join &amp;amp; Filter)&amp;nbsp; - Cursor Size - 1M&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;Store on Disk&lt;/LI&gt; 
 &lt;LI&gt;Load Once&lt;/LI&gt; 
 &lt;LI&gt;First match&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;2. Lookup-2: 35M (PostgreSQL Input with SQL Filter)&amp;nbsp;- Cursor Size - 1M&lt;/P&gt; 
&lt;UL&gt; 
 &lt;LI&gt;Store on Disk&lt;/LI&gt; 
 &lt;LI&gt;Load Once&lt;/LI&gt; 
 &lt;LI&gt;First match&lt;/LI&gt; 
&lt;/UL&gt; 
&lt;P&gt;Lookups in Parallel&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;3. Main Table: 27M (PostgreSQL Joins with multiple tables and Date Filters)&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Max Memory Settings I provided was 8GB&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Lookups are running fine, but once it reaches Main Table read it will slowdowns and after 1hr of running the whole process it comes out as &lt;FONT color="#FF0000"&gt;Java Heap Memory error.&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Not sure what else I got to look to make this work. And this is just one small Mapping and going fwd down the line got too complex mappings with huge data.&amp;nbsp;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Does parallelization help?&lt;/P&gt; 
&lt;P&gt;Does Multi Thread execution help? If Yes, what buffer unit size should set to?&lt;/P&gt; 
&lt;P&gt;Or Custom Batch processing process every 5M records will help?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please do advice, Thanks.&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 14 Sep 2019 17:05:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244719#M30781</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-09-14T17:05:33Z</dc:date>
    </item>
    <item>
      <title>Re: TMap and Lookups 20+M records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244720#M30782</link>
      <description>hi, 
&lt;BR /&gt;Java Heap Memory error is due to allowed memory to java process. 
&lt;BR /&gt;so you may (depend on your job): 
&lt;BR /&gt;- incrase -Xmx param 
&lt;BR /&gt;- split process 
&lt;BR /&gt;- use reload for etch row on tmap 
&lt;BR /&gt;- … 
&lt;BR /&gt;good luck</description>
      <pubDate>Mon, 16 Sep 2019 08:27:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244720#M30782</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-09-16T08:27:26Z</dc:date>
    </item>
    <item>
      <title>Re: TMap and Lookups 20+M records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244721#M30783</link>
      <description>&lt;P&gt;Thanks &lt;A href="https://community.qlik.com/s/profile/0053p000007LKpMAAW"&gt;@fdenis&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I use reload for each row - as far as I know this decreases the overall execution time&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Sep 2019 16:12:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244721#M30783</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-09-18T16:12:15Z</dc:date>
    </item>
    <item>
      <title>Re: TMap and Lookups 20+M records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244722#M30784</link>
      <description>it depend on how is build your job and how is filtred your lookup.</description>
      <pubDate>Thu, 19 Sep 2019 08:18:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/TMap-and-Lookups-20-M-records/m-p/2244722#M30784</guid>
      <dc:creator>fdenis</dc:creator>
      <dc:date>2019-09-19T08:18:39Z</dc:date>
    </item>
  </channel>
</rss>

