<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: million records in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262343#M42838</link>
    <description>Hi,&lt;BR /&gt;There is no standard answer for the maximum volume of data handled by Talend. It depends on project scale, job design, data source and so on.&lt;BR /&gt;Do you want to use SQL query or tMap?making a join? to&amp;nbsp;handle&amp;nbsp;Lookups?&lt;BR /&gt;&amp;nbsp;Please provide&amp;nbsp;the details. ELT components(all tables are in same DB), bulk execute are better way to load large data in a faster way.&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
    <pubDate>Wed, 20 Jan 2016 07:04:57 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-01-20T07:04:57Z</dc:date>
    <item>
      <title>million records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262342#M42837</link>
      <description>Hi,
&lt;BR /&gt;I have few questions
&lt;BR /&gt;Q1 . Can we able to process 60-80 million records in Talend ?
&lt;BR /&gt;Q2. What is the best way on performance wise , if I have 4-6 million records on lookup ?
&lt;BR /&gt;Q3.&amp;nbsp;which is best&amp;nbsp;option to store lookup data, if I have 4-6 million look up records&amp;nbsp;&amp;nbsp;? either file or DB ?
&lt;BR /&gt;Regards.</description>
      <pubDate>Tue, 19 Jan 2016 16:07:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262342#M42837</guid>
      <dc:creator>shrikavi2008</dc:creator>
      <dc:date>2016-01-19T16:07:55Z</dc:date>
    </item>
    <item>
      <title>Re: million records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262343#M42838</link>
      <description>Hi,&lt;BR /&gt;There is no standard answer for the maximum volume of data handled by Talend. It depends on project scale, job design, data source and so on.&lt;BR /&gt;Do you want to use SQL query or tMap?making a join? to&amp;nbsp;handle&amp;nbsp;Lookups?&lt;BR /&gt;&amp;nbsp;Please provide&amp;nbsp;the details. ELT components(all tables are in same DB), bulk execute are better way to load large data in a faster way.&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 20 Jan 2016 07:04:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262343#M42838</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-01-20T07:04:57Z</dc:date>
    </item>
    <item>
      <title>Re: million records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262344#M42839</link>
      <description>Probably a late reply, but it might useful for others. 
&lt;BR /&gt;Q1. Assuming that you are reading from RDBMS Table: Yes, but it should be in stream mode to&amp;nbsp;avoid&amp;nbsp;Java heap error.&amp;nbsp;I have loaded 33 million records extracted from 1 billion records table in MySQL through Talend. 
&lt;BR /&gt;Q2. For better performance, we need to consider the total&amp;nbsp;number of input data&amp;nbsp; and output records count. For the above scenario, I need only 33 millions of records out of 1 billion records. I used a inner&amp;nbsp;join query in extract query with stream mode, instead of lookup. Look up table's data needs to reside in the Server RAM. So,&amp;nbsp;size of lookup table/file depends on the&amp;nbsp;Server RAM, on which Talend installed. 
&lt;BR /&gt;Q3.&amp;nbsp;I advice you to store the lookup data in the DB and join them in the extract query, as the lookup data is huge. 
&lt;BR /&gt;Thanks, 
&lt;BR /&gt;Srini, 
&lt;BR /&gt;AgilitX</description>
      <pubDate>Wed, 27 Apr 2016 16:51:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/million-records/m-p/2262344#M42839</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-04-27T16:51:07Z</dc:date>
    </item>
  </channel>
</rss>

