<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Joining Large Tables in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352690#M119038</link>
    <description>Hi yogi9009,
&lt;BR /&gt;You can create the tDBInput schema yourself. for each attribute you wanna retrieve, create a correspondant column in your schema. good luck
&lt;BR /&gt;Youssef</description>
    <pubDate>Fri, 17 Apr 2009 14:56:34 GMT</pubDate>
    <dc:creator>youssef2</dc:creator>
    <dc:date>2009-04-17T14:56:34Z</dc:date>
    <item>
      <title>Joining Large Tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352687#M119035</link>
      <description>I have 20 tables that I need to join in a map. The output needs to go into a XML file. 
&lt;BR /&gt;I have defined the map as: 
&lt;BR /&gt;Oracle input tables --&amp;gt; tmap (joins the child tables and primary tables based on keys) --&amp;gt; tAdvanceFileOutputXML (has the xsd for generating the output XML file). The process runs fine for small data sets, but when the data volume is large (15-25 million records) then tools fails giving out of memory error. 
&lt;BR /&gt;TalenD is getting rows from each table individually and then joining the rows in the tmap transform. So, for large data sets it is failing. 
&lt;BR /&gt;Is there any way to have TalenD send a consolidated SQL query to the database so that the heavy joining is done on the database and not in TalenD. 
&lt;BR /&gt; 
&lt;BR /&gt;An alternate approach I tried was to pass the combined SQL query (for all the tables) by manually entering it in the Query section of the tDBInput component. However, when I click on the Guess Schema button, I get the following error: "Connect to DB error, or some errors in SQL query string, of Guess Schema not compatible with current SQL query string" 
&lt;BR /&gt;The query works in TOAD and is a simple one, pulling data from just two table with a simple join condition. 
&lt;BR /&gt;Is there any solution that you can point me to?</description>
      <pubDate>Sat, 16 Nov 2024 13:59:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352687#M119035</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T13:59:33Z</dc:date>
    </item>
    <item>
      <title>Re: Joining Large Tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352688#M119036</link>
      <description>By default Talend tries to load all the data needed to do joins/lookups in a tMap into memory. 
&lt;BR /&gt;There are some store on disk settings - where you specify a temp file name to use - and how many rows to store in memory/on disk. 
&lt;BR /&gt;However, if you're working w/lookup tables that have a large number of rows in them - and there are 20 tables. hen you are going to have performance issues. 
&lt;BR /&gt;Personally, if I'm dealing with a large number of rows, I like to do as many of my joins as possible in the SQL statement of one or more tDBInput components.</description>
      <pubDate>Thu, 16 Apr 2009 21:49:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352688#M119036</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-04-16T21:49:26Z</dc:date>
    </item>
    <item>
      <title>Re: Joining Large Tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352689#M119037</link>
      <description>In fact I tried that approach : passing the combined SQL query (for all the tables) by manually entering it in the Query section of the tDBInput component. However, when I click on the Guess Schema button, I get the following error: "Connect to DB error, or some errors in SQL query string, of Guess Schema not compatible with current SQL query string" 
&lt;BR /&gt;The query works in TOAD and is a simple one, pulling data from just two table with a simple join condition. 
&lt;BR /&gt;Are there any any other settings need to be done? 
&lt;BR /&gt;Thanks.</description>
      <pubDate>Fri, 17 Apr 2009 14:12:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352689#M119037</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2009-04-17T14:12:43Z</dc:date>
    </item>
    <item>
      <title>Re: Joining Large Tables</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352690#M119038</link>
      <description>Hi yogi9009,
&lt;BR /&gt;You can create the tDBInput schema yourself. for each attribute you wanna retrieve, create a correspondant column in your schema. good luck
&lt;BR /&gt;Youssef</description>
      <pubDate>Fri, 17 Apr 2009 14:56:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Joining-Large-Tables/m-p/2352690#M119038</guid>
      <dc:creator>youssef2</dc:creator>
      <dc:date>2009-04-17T14:56:34Z</dc:date>
    </item>
  </channel>
</rss>

