<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tMap vs SQL Joins, what is faster in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347016#M114342</link>
    <description>The memory consumption in this case only depends on the size of the dimension tables, not of the size of your fact table. 
&lt;BR /&gt;If your experiences memory leaks here - are your see them before the job starting the select of the main flow data or while the reading of the dimensions? 
&lt;BR /&gt;You can check this by selecting your fact data an deactivate all dimension inputs. If your job continue losing memory the problem is the tMSSqlInput component. 
&lt;BR /&gt;Geography sound not so much, customer - ok that can be a huge table and products should also be moderate. 
&lt;BR /&gt;Normally joining in the tMap is the fastest way with a lot of enhancements like getting datasets which does not match the join rule (data quality check).</description>
    <pubDate>Thu, 21 Feb 2013 20:57:09 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-02-21T20:57:09Z</dc:date>
    <item>
      <title>tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347012#M114338</link>
      <description>Hi,
&lt;BR /&gt;If i have to process say 50 million rows in my ETL, which implementation would be better/recommended :
&lt;BR /&gt;Using tMap for the transformations or writing a SQL.
&lt;BR /&gt;In other words, are SQL Joins and filters always faster than talend components?</description>
      <pubDate>Tue, 19 Feb 2013 13:06:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347012#M114338</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-19T13:06:10Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347013#M114339</link>
      <description>Hi, 
&lt;BR /&gt;For your description, I think the performance depends on your work flow, your data and the transformations.Could you give us more information about your ETL so that we can design a proper job for you.
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 20 Feb 2013 02:04:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347013#M114339</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-20T02:04:46Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347014#M114340</link>
      <description>&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;Hi, &lt;BR /&gt;For your description, I think the performance depends on your work flow, your data and the transformations.Could you give us more information about your ETL so that we can design a proper job for you.&lt;BR /&gt;Best regards&lt;BR /&gt;Sabrina&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;Hi Sabrina, 
&lt;BR /&gt;Consider a use case where, to loading my fact tables i have to join them with 5 dimension tables (like geography, customer, product, etc.) and pull their ID's into my fact table. Now since the data is huge (~ 50 million records per day) so doing it from tMap (that too storing the data on disk to avoid Java Heap space issues) will be slower than writing an equivalent SQL for this use case and triggering it from tMSSqlRow. 
&lt;BR /&gt;Let me know if my understanding is correct, or can the tMap perform faster than the SQL Join query for this specific use case. 
&lt;BR /&gt;Thanks in advance.</description>
      <pubDate>Wed, 20 Feb 2013 12:31:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347014#M114340</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-20T12:31:07Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347015#M114341</link>
      <description>Hi, 
&lt;BR /&gt; 
&lt;BLOCKQUOTE&gt; 
 &lt;TABLE border="1"&gt; 
  &lt;TBODY&gt; 
   &lt;TR&gt; 
    &lt;TD&gt;Consider a use case where, to loading my fact tables i have to join them with 5 dimension tables (like geography, customer, product, etc.) and pull their ID's into my fact table. Now since the data is huge (~ 50 million records per day) so doing it from tMap (that too storing the data on disk to avoid Java Heap space issues) will be slower than writing an equivalent SQL for this use case and triggering it from tMSSqlRow.&lt;/TD&gt; 
   &lt;/TR&gt; 
  &lt;/TBODY&gt; 
 &lt;/TABLE&gt; 
&lt;/BLOCKQUOTE&gt; 
&lt;BR /&gt;You are right that tMap will be slower than writing a equivalent SQL for this use case. Because Sql query is executed on the database engine directly. 
&lt;BR /&gt;In addition that, we provide ELT component and it recommend that you can use ELT components to achieve your goal. 
&lt;BR /&gt;For more details, please refer to: 
&lt;A href="https://help.talend.com/search/all?query=What+is+ELT&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt; What is ELT&lt;/A&gt; and 
&lt;A href="https://help.talend.com/search/all?query=ELT+components&amp;amp;content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;ELT components&lt;/A&gt; 
&lt;BR /&gt;Btw, you can use your forum account to log in 
&lt;A href="https://help.talend.com?content-lang=en" target="_blank" rel="nofollow noopener noreferrer"&gt;Talend Help Center&lt;/A&gt;. 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Thu, 21 Feb 2013 03:04:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347015#M114341</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-21T03:04:56Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347016#M114342</link>
      <description>The memory consumption in this case only depends on the size of the dimension tables, not of the size of your fact table. 
&lt;BR /&gt;If your experiences memory leaks here - are your see them before the job starting the select of the main flow data or while the reading of the dimensions? 
&lt;BR /&gt;You can check this by selecting your fact data an deactivate all dimension inputs. If your job continue losing memory the problem is the tMSSqlInput component. 
&lt;BR /&gt;Geography sound not so much, customer - ok that can be a huge table and products should also be moderate. 
&lt;BR /&gt;Normally joining in the tMap is the fastest way with a lot of enhancements like getting datasets which does not match the join rule (data quality check).</description>
      <pubDate>Thu, 21 Feb 2013 20:57:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347016#M114342</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-21T20:57:09Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347017#M114343</link>
      <description>Hi, 
&lt;BR /&gt;Thanks to both of you for your inputs. 
&lt;BR /&gt;So, can we conclude that joining with tMap is faster as compared to writing an equivalent SQL provided that the dimension table sizes are small, and there are no memory leaks?</description>
      <pubDate>Mon, 25 Feb 2013 14:41:06 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347017#M114343</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-25T14:41:06Z</dc:date>
    </item>
    <item>
      <title>Re: tMap vs SQL Joins, what is faster</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347018#M114344</link>
      <description>Hi,
&lt;BR /&gt;Generally speaking, you have the right thought, but one more question we don't have a clear limitation about the sizes of dimension table. How big is the dimension tables will cause a memory leak?
&lt;BR /&gt;Best regards
&lt;BR /&gt;Sabrina</description>
      <pubDate>Tue, 26 Feb 2013 02:42:25 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-vs-SQL-Joins-what-is-faster/m-p/2347018#M114344</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-02-26T02:42:25Z</dc:date>
    </item>
  </channel>
</rss>

