Pitfalls of the Aggr function - Page 2 - Qlik Community

hic · ‎2015-10-06

The Aggr() functions is one of the most advanced functions in the Qlik engine, and it is not always easy to use. This blog post is about its most common pitfalls.

sohailansari201 · ‎2015-10-13

Very informative post HIC. It would be very helpful if you could also talk about using nested Aggr() functions and its implications. For e.g. something like

Count(distinct{$<Active={1}, isBT={0}>} aggr(nodistinct If(

(date(

max(Aggr(nodistinct max({$<Active={1}>} if(AddMonths(CalMon...

Thanks.

christian77 · ‎2015-10-19

Thanks HIC.

Report Inappropriate Content · ‎2015-11-04

Thanks Henric, this couldn't have come at a better time for me. I'm trying to build a measure to show the expected number of diseases in a local population with known age distribution. To do this, I multiply the population at each age group by the rates of disease at national level, i.e.

Sum( Aggr( Sum( Population ) * Sum( DiseaseCases_National ) / Sum( Population_National ), AgeGroup )

To make this work on a chart with drilldowns for both location and disease type dimensions, I've had to add both the Location grain and the DiseaseType grain to the AgeGroup, which slowed things down significantly.

Any workaround or optimization tips you can provide would be much appreciated.

Dannie

hic · ‎2015-11-04

Do you really need Aggr() for this? I would just use

Sum( Population ) * Sum( DiseaseCases_National ) / Sum( Population_National )

and then add the appropriate dimensions in the chart, e.g:

Location, DiseaseType, AgeGroup

If this suggestion isn't the answer, I suggest you open a separate thread for your question.

HIC

Report Inappropriate Content · ‎2015-11-04

Yes I think I'll open a thread for this. The minimum requirement is to adjust (or weight) the expected cases by age, since disease rates are higher in the elderly and some locations have higher proportions of elderly. However I'd like to provide location and disease type as sliceable (and collapsible) dimensions, so the same measure (expected cases) can be used on multiple charts with different location/disease type selections.

D

mgranillo · ‎2016-06-23

Henric,

Can you explain the processing steps of this aggr example:

I have the result I want, I just don't fully understand how Qlik is working.

It's my goal to count the new distinct names per month. So month 1 should return a count of 5 (all names are new), month 2 should return a count of 2 (because F and G are the newly added names), and month 3 should be 1.

Sample Data:

LOAD * INLINE [

Month, Name

1,A

1,B

1,C

1,D

1,E

2,F

2,G

2,A

2,B

2,C

2,D

3,H

3,A

3,B

3,C

];

I've applied the following dimension and expression to a table:

Dimension: Month

Expression:

sum(

aggr(

count(distinct "Name")

,"Name")

)

Result:

It's my understanding from your explanation, that the aggr function will return a table like this:

Name	count(distinct "Name")
A	1
B	1
C	1
D	1
E	1
F	1
G	1
H	1

But how does the Month dimension get applied to this table to return a distinct count in the final visual?

hic · ‎2016-06-24

This is an example of a Grain Mismatch. You display the result of the Aggr() function in a chart that has a dimension that is missing inside the Aggr().

I would be very careful here - Grain Mismatches are unpredictable. Basically, what happens is that the first Month found inside the Aggr bin is used for the entire bin. So it is highly dependent on the load order. I would try to create an expression that does not depend on the initial load order.

HIC

hic · ‎2016-06-24

Or to illuminate this further: If you create an app with just one table:

Load * Inline
[Month,Product,OrderID
1,A,4
2,A,1
2,B,2
3,A,3
3,B,5
1,B,6
];

and then analyse this information using a similar expression

Sum(Aggr(Count(OrderID),Product))

you will find that the result depends on the order of the source records: Since 'A' is first found in Month 1, all 'A' orders are attributed to Month 1. Similarly all 'B' orders are attributed to Month 2. Change the order of the records, and you'll see.

HIC

mgranillo · ‎2016-06-24

Thank you Henric for the explanation. This is very helpful.

Mike

daniel_wennstro · ‎2020-04-27

Hi @hic ,

I have a case where I get issues with the "4 - Grain Missmatch" but i partly want to have it:

The customer has a lot of agreements, and some agreements applies to more than one product group. Now they want to analyze how many of their Agreements that applies to multiple Product Groups and also see how much of the Agreement amount that applies to each Product Group for those agreements.

I tried with an expression like:

sum(aggr(if(count(Distinct ProductGroup)>1,1),Agreement))

That works fine on top level:

It also works fine when i expand to see which agreements this is:

But, when I expand to see the product Groups that are associated to each agreement i get the error you write about with mismatching Grain.

However I would like to see that, and i would also like to see the Agreement amount for each Product Group included in the Agreements that has more than one Product Group.

I have also thought about doing this in the script, but then too much flexibility will be lost for the end users.

Do you have any idea how to solve this?