Skip to main content
Announcements
SYSTEM MAINTENANCE: Thurs., Sept. 19, 1 AM ET, Platform will be unavailable for approx. 60 minutes.
cancel
Showing results for 
Search instead for 
Did you mean: 
_AnonymousUser
Specialist III
Specialist III

Unable to view records from Match Analysis

Hi All, 
On Talend Open Studio for DQ 5.6, running on OsX 10.10, i am unable to access the "View Rows" option from Analysis Results, unlike in other analysis. Is this a known bug that has been fixed in TODQ 6, or am I doing something wrong.
I have followed the instructions given in the manual:
The explanation is however not detailed to the point that you're told how to extract this data.
Is there a reason for this (feature available in Enterprise edition for instance?)
Thanks
Labels (2)
5 Replies
Anonymous
Not applicable

Hi,
So far, Match Analysis don't support for "view rows" feature both in Talend Open Studio and Talend Enterprise Subscription Solution.
I have followed the instructions given in the manual:
The explanation is however not detailed to the point that you're told how to extract this data.

Could you please indicate the document you are referring?
Best regards
Sabrina
_AnonymousUser
Specialist III
Specialist III
Author

Hi Sabrina,
I meant the Talend help center guide (can't post URL):
How to show the match results
To collect duplicates from the input flow according to the match types you define, Levenshtein and Jaro-Winkler in this example, do the following:

If you are processing large data sets, select the Store on disk check box in the Analysis parameter view and:

In the Max buffer size field, type in the size of physical memory you want to allocate to processed data.
In the Temporary data directory path field, set the path to the directory where you want to store the temporary file.

Save the settings in the match analysis editor and press F6.
The analysis is executed. The match rule and blocking key are computed against the whole data set and the Analysis Results view is open in the editor
So, what would be the correct way to access the match data once the operation has run. There's nothing in the specified Temporary data directory path that remains from the operation.
Anonymous
Not applicable

EJB-alpha now registered as erwanbegoc
I think i just found the answer to my question: 
In Talend-DQ, the purpose of the match analysis is really to create a rule that can then be used in a Talend-DI, Talend-DQ is not meant to be used as a standalone application for that particular use-case (deduplication). Am I correct? 
_AnonymousUser
Specialist III
Specialist III
Author

Hi,
Am using Talend Data Management 6.1.1 subscription version. And in profiling --> Match Analysis, am unable to do "view rows". So can someone please help me in this regard?
Kind Regards,
Neeraja.
RoryJSomers
Contributor
Contributor

I too am having trouble doing this.

 

I have a paid for, full installation of Talend Studio running 7.1.1.  I am looking for duplicate entries within two 'near matching'  sets of client data.

 

I have created a Match Analysis.

I have created a Report of this.

 

What I need to be able to see are the *actual* results of the matching process to pass along to my client so they can see how wide spread the problem is, but also identify which records are true duplicates and which are not.  

 

So, I want to be able to provide my customer with an an excel document that says something along the lines of: Client Code | Forename | Surname | email

1234 | John | Smith | johnsmith@myemail.com |

9876 | Jon | Smith | johnsmith@myemail.com | 

 

I believe I will need to create a "job" to do this, but I do not know how to create a job to do this.

 

Thank you