Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hello,
In order to de-duplicate my data and get the golden records, i used the tMatchGroup component.
My problem is that the input data may contains arabic words which the existing matching function (Soundex, Jaro, levenshtein, ...) do not proceed.
Does anyone have an idea about this issue or can we create our custom matching function? and how?
Thanks in advance.
Hello,
It seems to be a new feature. Could you please create a new feature jira issue on talend bug tracker?
https://jira.talendforge.org/secure/Dashboard.jspa
Best regards
Sabrina