I have a question regarding data manipulations and calculations during the load script. I read some other discussions on the forum that talk about Levenshtein distance, which I'd like to use during my load. Here is what I'm trying to do - You may want to just read this description and refrain from looking at my script as I'm fairly certain it's far off what I actually want to do.
I have a list of invoices: (It will be much longer than this, I'm just starting simple)
LOAD * INLINE [
I want to get the average levenshtein distance of each invoice number from all the other invoice numbers. The entries with the lowest average numbers will be the most similar to each other. The problem I'm having is that the levenshtein function that I'm using takes two arguments and I can't figure out how to write my script to do what I want: