Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hi I would like to know how we could use set analysis to identify duplicates in a data. Say for eg you want to identify duplicate sales transactions in a data.
Can't it be done on a script level?
Tomasz
Hi Nataraj, you can create a simple table with the dimensions you want to detect as duplicates, and an expression with a simple count on a field that you know always have data.
Those above 1 are duplicates (there is more than one record in that combination of dimension values), or to keep only duplicates:
If(Count(FieldName)>1, 'Duplicated')
May be explain little more, Not exactly set analysis but we will try our best to showcase
Try to create a composite key of primary key and take a count of that key in expression.
If it is more than 1 then that transaction is duplicated.