Recipe for a Pareto Analysis – Revisited - Page 8 - Qlik Community

hic · ‎2016-12-13

“Which products contribute to the first 80% of our turnover?”

SDT · ‎2025-02-14

Great post with excellent longevity! Thank you all.

I am using the sorted aggregation to get the # of items making up 80% of revenue.

It works fantastic if I select a single fiscal year. What I would like is to create a visual over time showing how many items made up 80% of revenue each fiscal year. I built a bar chart with FiscalYear as the dimension and added FiscalYear to the sorted aggregation. It still does not work (unless I select a single fiscal year).

=COUNT(DISTINCT
AGGR(If(RANGESUM(ABOVE(SUM({<FLXID=>} InvoicedAmt)/SUM({<FLXID=>} TOTAL InvoicedAmt),1,RowNo()))<0.8, FLXID),
FiscalYear,(FLXID,(=SUM({<FLXID=>} InvoicedAmt),desc))))

I'm guessing it has to do with the ABOVE() function? Any help would be greatly appreciated.

Cheers,

Steve

marcus_sommer · ‎2025-02-14

You may try to return multiple value for the aggr() and/or adding a total to the above():

=COUNT(DISTINCT
AGGR(NODISTINCT If(RANGESUM(ABOVE(TOTAL SUM({<FLXID=>} InvoicedAmt)/SUM({<FLXID=>} TOTAL InvoicedAmt),1,RowNo()))<0.8, FLXID),
FiscalYear,(FLXID,(=SUM({<FLXID=>} InvoicedAmt),desc))))

SDT · ‎2025-02-14

Thank you very much @marcus_sommer . I tried this and it still does not work. When I select a single fiscal year, I get a different result than when showing all fiscal years.

marcus_sommer · ‎2025-02-14

Maybe it's caused from the aggr-sorting which didn't include the FiscalYear or in other words you may need to wrap the sorting-expression also with an aggr().

SDT · ‎2025-02-14

I tried this and still not working correctly...

COUNT(DISTINCT
AGGR(
IF(RANGESUM(ABOVE(
SUM({<FLXID=>} TOTAL <FiscalYear,FLXID> InvoicedAmt)/SUM({<FLXID=>} TOTAL <FiscalYear> InvoicedAmt)
,1,RowNo()))<0.8, FLXID),
(FiscalYear,(Numeric,Ascending)),(FLXID,(=SUM({<FLXID=>} InvoicedAmt),desc))))

marcus_sommer · ‎2025-02-14

It was more meant in this direction:

=COUNT(DISTINCT
AGGR(NODISTINCT If(RANGESUM(ABOVE(TOTAL SUM({<FLXID=>} InvoicedAmt)/SUM({<FLXID=>} TOTAL InvoicedAmt),1,RowNo()))<0.8, FLXID),
FiscalYear,(FLXID,(=aggr(SUM({<FLXID=>} InvoicedAmt), FiscalYear),desc))))

Such kind of task could become quite tricky. Therefore it's also important to understand what happens by the attempts which didn't return the expected results. Maybe some goes partly in the wanted direction and they might be combined in another way ...

Personally I would change the object to a table-chart to be able to use n expressions in parallel with different versions respectively their parts. Further helpful could be to simplify the testing by removing any conditions (if-loops as well as the set analysis) which isn't necessary to get a working logic and just reducing the sub-set of data per selections (maybe just a few dozen records).

SDT · ‎2025-02-14

Good suggestion on the straight table. I built one out with FiscalYear and Item (FLXID) in separate columns and then broke down the expression into its parts. I got it working well up to the point where it only shows an item if it is in the top 80% of revenue for that fiscal year.

I duplicated the table and removed the item dimension. Now the expression below returns 100%, 200%, and 300% for each fiscal year. The table has a total of 3 lines. I'm thinking I need an aggregation here. Tried a few variants with no luck. Ideally it should read 100% for each fiscal year.

RANGESUM(ABOVE(

SUM({<FLXID=>} InvoicedAmt)/SUM({<FLXID=>} TOTAL <FiscalYear> InvoicedAmt)

,0,RowNo()))

SDT · ‎2025-02-14

Note that the table with the items and fiscal years is correctly sorted by year and then item based on descending revenue for that item in that year.

As soon as I add the RANGESUM() expression above, it changes the sorting. I have also tried the following with zero luck.

{<BridgeDateType={'Invoiced'}, OrderState={'Invoiced'}, InvoicedAmt={">0"}>}

AGGR(RANGESUM(ABOVE(

SUM({<FLXID=>} InvoicedAmt)/SUM({<FLXID=>} TOTAL <FiscalYear> InvoicedAmt)

,0,RowNo()))

,(FiscalYear,(FLXID(SUM({<FLXID=>} InvoicedAmt),desc))))

SDT · ‎2025-02-14

OK, I built a table with Fiscal Year and the expression below.

Then I added two more columns where I added the fiscal year to the top line set analysis.

Those two columns show the correct results for each fiscal year. The expression below only shows the correct numbers when a single fiscal year is selected.

{<BridgeDateType={'Invoiced'}, OrderState={'Invoiced'}, InvoicedAmt={">0"}>}
COUNT(DISTINCT
AGGR(
If(RANGESUM(ABOVE(
SUM(InvoicedAmt)/SUM(TOTAL <FiscalYear> InvoicedAmt)
,1,RowNo()))<0.8, FLXID),
FiscalYear,(FLXID,(=SUM(InvoicedAmt),desc))))

Oleg_Troyansky · ‎2025-02-15

@SDT - in a complex calculation like this, many things could potentially go wrong. While it's hard to be certain without testing the actual app, I believe the following is the most likely issue.

When you calculate Pareto counts within each Fiscal Year, you need to sort your Items (FLXID) in the context of each year. However, your formula, specifically the part that is sorting FLXID, is going to sort the Items by the summarized invoice amounts "globally", i.e. not considering Fiscal Year. The only solution to this problem that I can think of, is to create a combo field FLXID|FY and use that combo as the second dimension of AGGR(). I believe that should work.

As a side note, I believe you should use 0 as the second parameter to your ABOVE() function if you want the calculation to include the current raw (and I believe you should want that).

Cheers,

Oleg Troyansky