Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
I have an app with multiple fact tables which i am trying to optimize. For that purpose trying to create a link table (have not included that part)
For doing this i am doing a load distnct
load
Distinct
Employee_ID,
Parent,
FactType
resident
MasterData
where FactType='xyz';
Master table has approx 120 million records.
the result of a expression count(distinct Employee_ID&Parent) is around 400k records
But somehow the load distinct doesnt work. the above load results in 120 million records
its weird. i remember reading somewhere about this but couldnt find the article again
but basically i replace above script with
load
Distinct
Employee_ID,
Parent,
MinString(FactType)
resident
MasterData
where FactType='xyz'
group by Employee_ID , Parent;
this one worked perfectly.
Can someone explain theory behind this?
Are you doing a preceding load before this? If so, the distinct needs to be on the top load and any distinct in the following loads are ignored.
Can you try this?
Why do you think it doesn't do a distinct load? How are you checking that?
Try explicitly creating a new table so the data won't accidentally be appended to an existing table and store that table in a qvd so you can check outside of your qlikview document.
TempLink: NOCONCATENATE Load Distinct Employee_ID, Parent, FactType resident MasterData where FactType='xyz'; STORE TempLink INTO [C:\CheckMe\CheckMe.qvd] (qvd);