<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic compare data between table from 2 DBMS in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/compare-data-between-table-from-2-DBMS/m-p/2296367#M69031</link>
    <description>Can someone please advise the best job flow. 
&lt;BR /&gt;I want to compare the result of 2 queries from a different databases (one is SQL server the other is Oracle), there is 20million rows returned from each query. 
&lt;BR /&gt; 
&lt;BR /&gt;I am thinking of something like this as it would merge the results of both queries and remove the duplicates but I don't know what components in Talend to use to achieve this (if there is better performance from using a CRC row - pls incorporate that in the job flow): 
&lt;BR /&gt;select cust_id, cust_name, address from sqlserv_table 
&lt;BR /&gt;UNION 
&lt;BR /&gt;select c_id, customer_name, cust_address from oracle_table 
&lt;BR /&gt;Note: there is no primary_key on the tables so (i guess all columns together/or the CRC could be used to form the PK) 
&lt;BR /&gt;Finally I would like to be able to output which rows are present in the sqlserv_table but missing from the oracle_table and vice_versa (to 2 separate files). 
&lt;BR /&gt; 
&lt;BR /&gt;Thank you!</description>
    <pubDate>Sat, 16 Nov 2024 12:25:38 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T12:25:38Z</dc:date>
    <item>
      <title>compare data between table from 2 DBMS</title>
      <link>https://community.qlik.com/t5/Talend-Studio/compare-data-between-table-from-2-DBMS/m-p/2296367#M69031</link>
      <description>Can someone please advise the best job flow. 
&lt;BR /&gt;I want to compare the result of 2 queries from a different databases (one is SQL server the other is Oracle), there is 20million rows returned from each query. 
&lt;BR /&gt; 
&lt;BR /&gt;I am thinking of something like this as it would merge the results of both queries and remove the duplicates but I don't know what components in Talend to use to achieve this (if there is better performance from using a CRC row - pls incorporate that in the job flow): 
&lt;BR /&gt;select cust_id, cust_name, address from sqlserv_table 
&lt;BR /&gt;UNION 
&lt;BR /&gt;select c_id, customer_name, cust_address from oracle_table 
&lt;BR /&gt;Note: there is no primary_key on the tables so (i guess all columns together/or the CRC could be used to form the PK) 
&lt;BR /&gt;Finally I would like to be able to output which rows are present in the sqlserv_table but missing from the oracle_table and vice_versa (to 2 separate files). 
&lt;BR /&gt; 
&lt;BR /&gt;Thank you!</description>
      <pubDate>Sat, 16 Nov 2024 12:25:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/compare-data-between-table-from-2-DBMS/m-p/2296367#M69031</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:25:38Z</dc:date>
    </item>
    <item>
      <title>Re: compare data between table from 2 DBMS</title>
      <link>https://community.qlik.com/t5/Talend-Studio/compare-data-between-table-from-2-DBMS/m-p/2296368#M69032</link>
      <description>Hi,&lt;BR /&gt;You can use the tMap component. It allows you to make a join between 2 tables according to differents columns, this way you are able to catch which lines are not in the other table. . In my example sqlserv_table is the entry, oracle_table is the lookup. I'm catching the rows which are in the sqlserv_table but NOT in the oracle_table.  Just change the table if you want to the inverse (find the line which are in oracle and not in sqlserve). Tell me if you got through some problems.</description>
      <pubDate>Thu, 26 Jan 2012 13:35:56 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/compare-data-between-table-from-2-DBMS/m-p/2296368#M69032</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2012-01-26T13:35:56Z</dc:date>
    </item>
  </channel>
</rss>

