Qlik Community

QlikView App Development

Discussion Board for collaboration related to QlikView App Development.

martynlloyd
Contributor III

Use HASH to match misspelt names?

Hi,

I have two sets of customer name data, one is reliable, the other is not, for example

Correct name: ACE Construction and Demolition Limited

Variations in user input

ACE Construction

ACE Demolition

A.C.E. Contrction LTD

I want to be able to create a 'best-fit' matching application - I'm thinking of resequencing the strings, as in

aacccdddeeeiiiiillmmnnnnoooorsttttu

ACE Construction would then have a match coefficient of 15 out of 35; removing Limited and LTD etc would give a match of 15/28

or 54%.

Any ideas?

Best regards,

Marty.

Tags (3)
2 Replies
Highlighted
shane_spencer
Valued Contributor

Re: Use HASH to match misspelt names?

This Document sprang to mind: http://community.qlik.com/docs/DOC-7051 it's not exactly the same but it seems to do a similar thing.

MVP
MVP

Re: Use HASH to match misspelt names?

Hi Martyn,

you can try Levenshtein distance algorithm:

http://community.qlik.com/message/517405#517405

- Ralf

Community Browser