<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Extract Difference Data from two files in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344806#M112418</link>
    <description>No, the tables are in different databases.&lt;BR /&gt;Will try to implement the solution suggested by you.&lt;BR /&gt;Thanks</description>
    <pubDate>Tue, 19 Sep 2017 14:15:40 GMT</pubDate>
    <dc:creator>vidya821</dc:creator>
    <dc:date>2017-09-19T14:15:40Z</dc:date>
    <item>
      <title>Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344795#M112407</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Is it possible to extract&amp;nbsp;the difference data from tFileCompare ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;One approach to compare is LookUp (but i have different set of files to compare and each set has different headers, so for lookup i need to modify the file schema everytime), is there any component to compare two files and get the difference data as output directly ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Fri, 15 Sep 2017 09:19:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344795#M112407</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-15T09:19:40Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344796#M112408</link>
      <description>&lt;P&gt;Are you looking for a comparison character by character? So, for example, in the following examples, the differences would be as shown below....&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;aaaaaaaaaaaaaaaabbbbbbbbbbbbbbcccccccccccccccdddddddddddd
eeeeeeeeeeeeeeffffffffffffffffffffggggggggggggggggghhhhhhhhhhhhiiiiiiii&lt;/PRE&gt;
&lt;PRE&gt;&lt;FONT color="#FF0000"&gt;1&lt;/FONT&gt;&lt;FONT color="#00FF00"&gt;aaaaaaaaaaaaaaaabbb&lt;/FONT&gt;&lt;FONT color="#FF0000"&gt;2&lt;/FONT&gt;&lt;FONT color="#00FF00"&gt;bbbbbbbbbbbcccccccccccccccdddddddddddd
eeeeeeeeeeeeeeffffffffffffffffffffggggggggg&lt;/FONT&gt;&lt;FONT color="#FF0000"&gt;3&lt;/FONT&gt;&lt;FONT color="#00FF00"&gt;gggggggghhhhhhhhhhhhiiiiiiii&lt;/FONT&gt;&lt;/PRE&gt;
&lt;P&gt;If that is what you want, there is nothing "out of the box" and it might be quite tricky to build this using standard components. You could try the Talend Exchange or look for a Java API to handle this and call it from Talend.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Sep 2017 11:24:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344796#M112408</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-15T11:24:45Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344797#M112409</link>
      <description>No, i want to do row by row comparison with columns values check
&lt;BR /&gt;E.g
&lt;BR /&gt;File 1
&lt;BR /&gt;A|B|C
&lt;BR /&gt;1|1|1
&lt;BR /&gt;
&lt;BR /&gt;File 2
&lt;BR /&gt;A|B|C
&lt;BR /&gt;1|1|1
&lt;BR /&gt;2|2|2
&lt;BR /&gt;
&lt;BR /&gt;So if i compare these files, the oputput file should be
&lt;BR /&gt;A|B|C
&lt;BR /&gt;2|2|2
&lt;BR /&gt;
&lt;BR /&gt;</description>
      <pubDate>Fri, 15 Sep 2017 12:47:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344797#M112409</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-15T12:47:26Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344798#M112410</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Where are your input files from? Tables? Are you looking for redundancy analysis in talend data quality prodcut?&lt;/P&gt;
&lt;P&gt;Best regards&lt;/P&gt;
&lt;P&gt;Sabrina&lt;/P&gt;</description>
      <pubDate>Mon, 18 Sep 2017 03:53:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344798#M112410</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-18T03:53:38Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344799#M112411</link>
      <description>Yes, the inputs are from tables with same scehma but from two different databases (Oracle 9 and Oracle 12 respectively).
&lt;BR /&gt;however i cannot connect both the databases in talend due to some concerns, but i can extract data files individually.
&lt;BR /&gt;i need these data files to be compared so as to validate if all the data for few selected tables has migrated correctly from oracle 9 to Oracle 12</description>
      <pubDate>Mon, 18 Sep 2017 08:54:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344799#M112411</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-18T08:54:14Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344800#M112412</link>
      <description>&lt;A href="https://community.qlik.com/s/profile/00539000004XsaeAAC"&gt;@xdshi&lt;/A&gt;, 
&lt;BR /&gt;Can you please tell me if below design is possible in Talend. 
&lt;BR /&gt;1) i have two database connections available in application (METADATA) 
&lt;BR /&gt;2) Both the databases have same tables with similar schémas (There are 30 or so tables ) 
&lt;BR /&gt;3) I want to compare the data from the two corresponding tables 
&lt;BR /&gt;4) Can i design a job where i can send the name of the tables one after another and the job does data comparison and creates an output file with difference data rows 
&lt;BR /&gt;5) In this way i can avoid the need to create a new job for new table comaprison (for 30 tables i hve to create 30 jobs)</description>
      <pubDate>Tue, 19 Sep 2017 11:15:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344800#M112412</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-19T11:15:01Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344801#M112413</link>
      <description>&lt;P&gt;If the tables that are supplying the data have the same schema, you don't need to worry about the headers at all. Just join your two files using a tMap and ensure that every column that should be the same is joined. Then have two outputs; 1 for the matches and one for rows from the main that do not match&lt;/P&gt;</description>
      <pubDate>Tue, 19 Sep 2017 11:26:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344801#M112413</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-19T11:26:59Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344802#M112414</link>
      <description>Thanks Rhall, 
&lt;BR /&gt;but now i have two tables directly and not file data, i know that we can join two oracle tables also and perform the operation but my job has to handle 30 set of tables. 
&lt;BR /&gt;is it possible to design one job that can be reused by every table set ? 
&lt;BR /&gt;The DB's are like these below 
&lt;BR /&gt;DB1 DB2 
&lt;BR /&gt;Table1-Schema1 Table1-Schema1 
&lt;BR /&gt;Table2-Schema2 Table2-Schema2 
&lt;BR /&gt;Table3-Schema3 Table3-Schema3 
&lt;BR /&gt;So i have to campare Table1 from DB1 and DB2, create an output file with uncommon data 
&lt;BR /&gt;after table1 the process should continue for Table2 and respectively</description>
      <pubDate>Tue, 19 Sep 2017 12:50:58 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344802#M112414</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-19T12:50:58Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344803#M112415</link>
      <description>&lt;P&gt;Yes, you can create one job for all tables.....but it will only tell you about rows that are exactly the same....and it will be complicated to build if you are new to this.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;1) Input your data from your tables with ALL of the columns concatenated and hashed. Output this as a String (Varchar). You will need a primary key on the table to be output as well. So your data from each table will be ....&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Key&lt;BR /&gt;ConcatenatedHash&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;2) In your tMap join on your ConcatenatedHash column. Remember that the Main flow will be the only flow where ALL rows are guaranteed to be tested. If you require both sides to be tested you will have to reverse the lookup in another tMap.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;3) When you identify matches, you can link back to your unconcatenated data using the Key.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 19 Sep 2017 13:21:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344803#M112415</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-19T13:21:53Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344804#M112416</link>
      <description>Yeah thats a bit of complex process, as per the requirement i was hpoing to implement the below oracle statement in Talend&lt;BR /&gt;&lt;BR /&gt;select * from DB1-table1&lt;BR /&gt;MINUS&lt;BR /&gt;select * from DB2-table1&lt;BR /&gt;&lt;BR /&gt;its okay if this is not possible via talend&lt;BR /&gt;Thanks</description>
      <pubDate>Tue, 19 Sep 2017 13:40:27 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344804#M112416</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-19T13:40:27Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344805#M112417</link>
      <description>&lt;P&gt;That is not possible. When you do that in Oracle it is fully aware of the schemas (although they have to be same....which you were saying is not necessarily the case here). You can do that with Talend in your database component (if both tables are in the same database), but this is not the case....or is it? If it is the case, just replace the table names with context variables. However if it is not the case, the solution I gave you is not complicated at all. I believe you can make it completely dynamic if you have enough permissions on your database user.&lt;/P&gt;</description>
      <pubDate>Tue, 19 Sep 2017 14:10:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344805#M112417</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-19T14:10:12Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344806#M112418</link>
      <description>No, the tables are in different databases.&lt;BR /&gt;Will try to implement the solution suggested by you.&lt;BR /&gt;Thanks</description>
      <pubDate>Tue, 19 Sep 2017 14:15:40 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344806#M112418</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-19T14:15:40Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344807#M112419</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/005390000069RuGAAU"&gt;@rhall&lt;/A&gt;,&lt;BR /&gt;1) Input your data from your tables with ALL of the columns concatenated and hashed. Output this as a String (Varchar). You will need a primary key on the table to be output as well. So your data from each table will be ....&lt;BR /&gt;Key&lt;BR /&gt;ConcatenatedHash&lt;BR /&gt;-- Is there any component which does the concatenation and hashing of the columns ? or how should i do it using talend&lt;/P&gt;</description>
      <pubDate>Wed, 20 Sep 2017 09:17:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344807#M112419</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-20T09:17:23Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344808#M112420</link>
      <description>You can concatenate and hash in your Oracle query (which would help make it more dynamic). This way your job does not have to be aware of the schema, you just need to pass it an appropriate SQL query with the concatenation and hashing built in.</description>
      <pubDate>Wed, 20 Sep 2017 14:45:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344808#M112420</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-20T14:45:44Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344809#M112421</link>
      <description>Thanks Rhall, 
&lt;BR /&gt;This worked perfectly fine for me, 
&lt;BR /&gt;the design to concatenate was like 
&lt;BR /&gt;-tOracleInput component to find columns for the respective tables - &amp;gt; tDenoramalize component to combine all the columns with separator ||'|'|| and use the same in the select queries</description>
      <pubDate>Thu, 21 Sep 2017 11:20:17 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344809#M112421</guid>
      <dc:creator>vidya821</dc:creator>
      <dc:date>2017-09-21T11:20:17Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344810#M112422</link>
      <description>&lt;P&gt;Why is concatenation and hashing needed?&lt;/P&gt;</description>
      <pubDate>Thu, 20 Aug 2020 13:43:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344810#M112422</guid>
      <dc:creator>shindeHarshada</dc:creator>
      <dc:date>2020-08-20T13:43:57Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Difference Data from two files</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344811#M112423</link>
      <description>&lt;P&gt;why can't we compare 2 rows directly?&lt;/P&gt;</description>
      <pubDate>Fri, 21 Aug 2020 07:16:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Extract-Difference-Data-from-two-files/m-p/2344811#M112423</guid>
      <dc:creator>shindeHarshada</dc:creator>
      <dc:date>2020-08-21T07:16:23Z</dc:date>
    </item>
  </channel>
</rss>

