Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 
Not applicable

Matching strings with spelling differences in script

I am loading sales ndata. The customers names in these data are not always consistently spelled. Is there a way to automatically match nearly identically spelled customer names? Please note that I have thousands of names, so I cannot manually make a matching table.

6 Replies
robert_mika
Master III
Master III

eespiritu
Creator
Creator

Hi Marc,

Does customer names data has an ID or something similar to identify each one of these customers (email)?

Gysbert_Wassenaar

See this blog post: Teaching the load script new tricks

And my comment in the comments section.


talk is cheap, supply exceeds demand
Not applicable
Author

Not so. Otherwise, the solution would be trivial, isn't it,

Not applicable
Author

This is interesting. Not sure it will work for my issue. Some of the cuatomer names are not only spelled differntly, but also have some extra terms. E.g. Svenson and Svenson AB. Do you think these phonetic algorithms would work in this situation?

Anonymous
Not applicable
Author

Marc,

If there is no ID - there is no reliable solution.  The solution to which gwassenaar is pointing is a solution for a different problem, that is how to merge similar sounding names into one.  It may or may not be what you need.

The problem is that there is no way to tell that "Svenson" and "Svenson AB" are two different persons.  To make the problem worse - you cannot tell for sure even that "Marc Donckers" and "Marc Donckers", spelled exactly the same way, is the same person.  Can you?  Are "Cassius Clay" and "Muhammad Ali" two different people?  Not really.  OK, the last example is rather uncommon, I admit...

So, you either have to create an ID combining several fields if possible (date of birth, eye color, etc.), or go manually.

Regards,

Michael