Algorithm for matching duplicate names

Qlik.com | Qlik Help | Resources

Ask a Question

I need to identify name duplicates containing friendly and official names in a dataset. For e.g -

1) William Stark (official name)

Bill Stark (Friendly Name)

2) Bradley Thomas (Official Name)

Brad Thomas (Friendly Name)

3) Robert Gordon (Official Name)

Bob Gordon (Friendly Name)

I was looking at Jaro, Jaro-Winkler and Soundex algorithms, but I'm wondering whether there are better methods. Appreciate any guidance/ best practises you can provide.

0 Replies

Algorithm for matching duplicate names

Talend Data Quality

v7.x