Skip to main content
Announcements
Qlik Connect 2025: 3 days of full immersion in data, analytics, and AI. May 13-15 | Orlando, FL: Learn More
cancel
Showing results for 
Search instead for 
Did you mean: 
cbaqir
Specialist II
Specialist II

String Percentage Comparison?

Is there a way to get QV to compare how similar two values are?

In other words, match on 75% of the string to return possible duplicates?

John Doe - ABCD

John Doe - EFGH

10 Replies
cbaqir
Specialist II
Specialist II
Author

marcowedel

I tried loading my temp table from a QVD but it still takes more than an hour to refresh. Any suggestions on how to speed it up?

tabPlanTemp:
LOAD Distinct
DESCRIPTION
FROM [..\External_Data\PP_USAGE_ECISA.xlsx] (ooxml, embedded labels, table is [CCG_PLANS])
Where Len(Trim(DESCRIPTION));

Join
LOAD DESCRIPTION as PLAN_DESC2
Resident tabPlanTemp;

tabPlan:
LOAD DESCRIPTION,
PLAN_DESC2,
Num(1-Levenshtein(DESCRIPTION,PLAN_DESC2)/RangeMax(Len(DESCRIPTION),Len(PLAN_DESC2)),'0.0%') as Similarity
Resident tabPlanTemp
Where not DESCRIPTION follows PLAN_DESC2;


//tabPlanTemp:
//LOAD DESCRIPTION,
// PLAN_DESC2
//FROM
//[..\QVD\01_Extract_QVD\EKG PowerPlan.QVD]
//(qvd);
//


DROP Table tabPlanTemp;