Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Bucharest on Sept 18th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Anonymous
Not applicable

Extract different record values from 2 csv files

Hi there, I m trying to compare the records of 2 identical csv files that contain about 80 different fields. I want to list for each field the records that have different values and write the output to a single file. I've tried comparing the two files with a tMap (by joining the primary key of the two files), and creating a single output for each field, which is not very efficient for more than 10 fields. Is there any alternative approach that I can apply ? 


Thanks,
Panos

Labels (2)
3 Replies
Anonymous
Not applicable
Author

Hi,
Could you please take a look at Talend DQ profile feature which is used to analyze column, file and table structure
TalendHelpCenter:Comparing identical columns in different tables to see if it is satisfying your needs?
Best regards
Sabrina
Anonymous
Not applicable
Author

Hi, thanks for replying ! I don't think DQ will help me, as I have more than 5 pairs of identical csv files that I want to check, which are about 400mb each. I would prefer DI in order to automate the process for all the files I want to validate.
Regards,
Panos
Anonymous
Not applicable
Author

I managed to find a solution to my problem, so I' m posting it in case someone needs to do something similar with DI. I used a tMap component to join on the equal ID of the two tables, a custom tUnpivotRow component to transform the csv file to a key-value table and filters to filter out equal records. Finally I got the format I wanted by extracting the values from the unpivoted table.

0683p000009MFYT.png
0683p000009MFTZ.png
0683p000009MFYY.png