<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tMap default innerjoin / user defined condition - execution time in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234620#M23936</link>
    <description>Hi 
&lt;BR /&gt;The condition here will surely lead to more calculations and comparisons which affects performance. 
&lt;BR /&gt;I need to know whether the execution time is acceptable. 
&lt;BR /&gt;How many rows in both of these two files? 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Pedro</description>
    <pubDate>Tue, 05 Jun 2012 09:07:15 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2012-06-05T09:07:15Z</dc:date>
    <item>
      <title>tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234616#M23932</link>
      <description>Hi All,&lt;BR /&gt;I need to merge 2 tables on some condition, So i am using tMap and applying inner join. even for huge set of records, it works absolutely fine.  &lt;BR /&gt;In my new requirement, I need to merge the  tables on some user defined condition . &lt;BR /&gt;When I add some condition in tMap, there is lot of impact on execution time of job, &lt;BR /&gt;please find the attached screen shots and suggest me, &lt;BR /&gt;1. is there any flaw or is that's how tMap works.&lt;BR /&gt;2. on merge, I get only matched records in the output , is there any way to catch the rejected / unmatched records from both the tables.&lt;BR /&gt;ex: I am merging 'table A' and 'table B' based on some condition,  on execution, i am getting only the matched records in the output, I also need to catch the records, not matched in table A and table B&lt;BR /&gt;Thanks&lt;BR /&gt;Chaya</description>
      <pubDate>Tue, 05 Jun 2012 06:06:04 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234616#M23932</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-05T06:06:04Z</dc:date>
    </item>
    <item>
      <title>Re: tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234617#M23933</link>
      <description>Hi Chaya&lt;BR /&gt;I can't find out any flaws from the image. All settings are decided by your job logic.&lt;BR /&gt;If 'table A' inner join with 'table B', you will get unmatched records of table A by selecting 'Catch lookup inner join reject' true.&lt;BR /&gt;If 'table B' inner join with 'table A', you will get unmatched records of table B as above.&lt;BR /&gt;Regards,&lt;BR /&gt;Pedro</description>
      <pubDate>Tue, 05 Jun 2012 07:51:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234617#M23933</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-05T07:51:00Z</dc:date>
    </item>
    <item>
      <title>Re: tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234618#M23934</link>
      <description>Hi Chaya,
&lt;BR /&gt;According to the filter you are applying you are performing an inner join on the key of row4 (identification_number)
&lt;BR /&gt;I would suggest you put row3.id_number back where it was in screen one, so on the left side of row4.identification_number, instead of in the filter. this should make Talend perform a true inner join, because I think it is not performing this inner join when there is no join condition and only a filter, which might slow down your process. You might even add id_type to the join, if my iterpretation of your filter is correct, but that's up to you.
&lt;BR /&gt;As for question 2: Do as Pedro suggests: you can add an output row (create new or create join, I suppose you'll need "create join table from") and set it's options to "catch inner join rejects". This will make all unmatched row3 records go to this output. An outer join, to catch all records from all inputs is not possible unfortunately.
&lt;BR /&gt;Hope this helps.
&lt;BR /&gt;Regards,
&lt;BR /&gt;Arno</description>
      <pubDate>Tue, 05 Jun 2012 08:13:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234618#M23934</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-05T08:13:32Z</dc:date>
    </item>
    <item>
      <title>Re: tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234619#M23935</link>
      <description>Thanks for the reply Pedro, 
&lt;BR /&gt;Wel, 
&lt;BR /&gt;catching rejected records is fine, i could achieve that. Thanks. 
&lt;BR /&gt;but my main concern was performance. I have a simple job, 
&lt;BR /&gt;with 2 input files and tMap (where in I have used defined expression) &amp;amp; tLog. 
&lt;BR /&gt;Why does is it takes more time, when we define conditions. If the same job , If u remove the condition &amp;amp; use normal inner join that works faster.. I am confused.. Please suggest me.</description>
      <pubDate>Tue, 05 Jun 2012 08:39:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234619#M23935</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-05T08:39:37Z</dc:date>
    </item>
    <item>
      <title>Re: tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234620#M23936</link>
      <description>Hi 
&lt;BR /&gt;The condition here will surely lead to more calculations and comparisons which affects performance. 
&lt;BR /&gt;I need to know whether the execution time is acceptable. 
&lt;BR /&gt;How many rows in both of these two files? 
&lt;BR /&gt;Regards, 
&lt;BR /&gt;Pedro</description>
      <pubDate>Tue, 05 Jun 2012 09:07:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234620#M23936</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-05T09:07:15Z</dc:date>
    </item>
    <item>
      <title>Re: tMap default innerjoin / user defined condition - execution time</title>
      <link>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234621#M23937</link>
      <description>Hey Pedro, 
&lt;BR /&gt;In the process of working on the above concern, we noticed that, in the user defined condition if I have only &amp;amp;&amp;amp; (AND) operations, that works fine, in case if I have || (OR) operations, only those comparisons are taking more time. 
&lt;BR /&gt;I have around 5,00,000 records on 1 side &amp;amp; the other end I have around 3,50,000 records 
&lt;BR /&gt;The above comparison is taking around 3-4 Hrs .. &amp;amp; some times.. its just hangs in between.. 
&lt;BR /&gt; 
&lt;BR /&gt;Thanks 
&lt;BR /&gt;Chaya</description>
      <pubDate>Wed, 06 Jun 2012 05:53:39 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/tMap-default-innerjoin-user-defined-condition-execution-time/m-p/2234621#M23937</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-06-06T05:53:39Z</dc:date>
    </item>
  </channel>
</rss>

