2 Replies Latest reply: Feb 4, 2016 7:00 AM by Marcus Sommer RSS

    count "almost" distinct values

    Evgeny Stuchalkin

      Hello! I'm have a client base, with phone numbers and names. I want to filter bots, thats placing orders sometimes. One of the main bot's singn: one phone number with different names.

       

      So, i can count distinct names for every phone number.

       

      98787c9551.jpg

      But! Sometime real customers entering ther names differently. Like this:

      d950e5ab2f.jpg

      Perfect example. This is three versions of one name: Short Name, Extended name with error, and Real extended name.

      Also, some body use several space bars between words, or even strange symbols like "+" instead of space bar.

       

      So, my question is: How can i count this names as one?