Hi Guys -
Here is what I know - its based on a whole host of factors including unique matches, total matches, cardinality, data type, symbol count, unique values - so there is a pretty sophisticated algorithm working to perform the data profiling. I know at one time we either displayed the percentage matched, or had percentages calculated and documented but it was difficult for this to be replicated so we removed them from the doc.
I attached a data set for you to play with
For example if you attempt to match Customers with Products - (orange)
it associates on CustomerNumber and ProductsID - obviously they are numeric values that were the closest values in both tables to match.
ProductID has values 1 to 77
CustomerNumber as 1 to 89
Even if I add more unique IDs to the products table to match the customer table 100% - I still get orange as it indicates a possible match - but based on other factors in the data - such as the supplier ID being available (also numeric and similar)- it still may not be the proper association. Also if remove supplierID form products - I still get Orange - this is because it recognizes in the CustomerNumber that the values are only 1:1 and not 1:many. (in other word there is exactly 1 value in Customer and 1 value in products that match, so this indicates the orange)
Let me know if this is clear - if not I can do some more digging for you.
Thanks Jagan for alerting me of the thread :-)
When applicable please mark the appropriate replies as CORRECT. This will help community members and Qlik Employees know which discussions have already been addressed and have a possible known solution. Please mark threads as HELPFUL if the provided solution is helpful to the problem, but does not necessarily solve the indicated problem. You can mark multiple threads as HELPFUL if you feel additional info is useful to others.