Solved: How can I use sum() instead of count(Distinct ) - Page 2 - Qlik Community

Report Inappropriate Content · ‎2015-09-24

Hi all,

It is said that using Sum() instead of count(Distinct ) the performance will increase. But I have duplicate Key values in my data.

So how can I use it. Can anyone help me on this?

Regards,

Anjali Gupta

Anonymous · ‎2015-09-25

Despite the Myth of Count Distinct, record counters are still very useful and I use them extensively. Well worth checking out the suggestions / links that have been suggested.

simenkg · ‎2015-09-25

As others have said; today you can use Count(Distinct) without a performance issue.

If you really want to use SUM instead of count(Distinct) you can do the following:

Fact:

load %Keytobecounted, Facts, numbers, etc from fact.qvd (qvd);

CountingTable:

load distinct %Keytobecounted, 1 as Keycounter resident Fact;

This creates a table with the unique keys and a field you can sum that will equal a count of keys.

now you can use the expression Sum(Keycounter) instead of Count(Distinct %Keytobecounted).

qlikviewwizard · ‎2015-09-25

Anjali wrote" It is said that using Sum() instead of count(Distinct ) the performance will increase.".

It is not correct. We can use either one. Count(Distinct) won't reduce the performance.

Read tresco's link.

Anonymous · ‎2015-09-25

Like many things this all depends......

Count Distinct used to be single threaded and hence could be slow. Some releases ago it was made multi-threaded which made it faster dependent on how many processor cores you have.

But it will take more processor cycles to execute, albeit striped across available cores.

Processor cycles = Electricity used = Heat produced = Global warming

I know I am being pedantic here, generating electricity produces CO2 and every little helps.