Qlik Community

QlikView App Development

Discussion Board for collaboration related to QlikView App Development.

Announcements

Breathe easy -- you now have more time to plan your next steps with Qlik!
QlikView 11.2 Extended Support is now valid through December 31, 2020. Click here for more information.

martynlloyd
Contributor III

Use HASH to match misspelt names?

Hi,

I have two sets of customer name data, one is reliable, the other is not, for example

Correct name: ACE Construction and Demolition Limited

Variations in user input

ACE Construction

ACE Demolition

A.C.E. Contrction LTD

I want to be able to create a 'best-fit' matching application - I'm thinking of resequencing the strings, as in

aacccdddeeeiiiiillmmnnnnoooorsttttu

ACE Construction would then have a match coefficient of 15 out of 35; removing Limited and LTD etc would give a match of 15/28

or 54%.

Any ideas?

Best regards,

Marty.

Tags (3)
2 Replies
Highlighted
shane_spencer
Valued Contributor

Re: Use HASH to match misspelt names?

This Document sprang to mind: http://community.qlik.com/docs/DOC-7051 it's not exactly the same but it seems to do a similar thing.

MVP & Luminary
MVP & Luminary

Re: Use HASH to match misspelt names?

Hi Martyn,

you can try Levenshtein distance algorithm:

http://community.qlik.com/message/517405#517405

- Ralf