Skip to main content
Announcements
See what Drew Clarke has to say about the Qlik Talend Cloud launch! READ THE BLOG
cancel
Showing results for 
Search instead for 
Did you mean: 
gauthamchilled
Creator
Creator

discard outliers in the scatter chart

hi,

i would like to discard or ignore the outliers in the scatter chart. can anyone help me? attached the example

9 Replies
gauthamchilled
Creator
Creator
Author

any help?

muthukumar77
Partner - Creator III
Partner - Creator III

Hi,

Do you want like this?

Muthukumar Pandiyan
gauthamchilled
Creator
Creator
Author

I dont see any difference between my file and your image.

I want to exclude the values which are unusually large on both axis..

tresesco
MVP
MVP

May be like in attached sample.

gauthamchilled
Creator
Creator
Author

Hi Tresesco,

This is a sample, so we could easily figure out what we dont want and filter it out.

But actual dataset with many countries and many years of revenue and profit, we cant hardcode the number,

Is there anyway to remove the outliers based on the current selected value dynamically using function like fractile or standard deviation???

gauthamchilled
Creator
Creator
Author

bump

tresesco
MVP
MVP

You have to have a rule to decide which values to be discarded, say for example if you wish to discard the top two values, you may try expression like:

Sum ({<[Customer Name]={"=Rank(Sum (Sales))>2"}>}Sales)

Update: Using fractile()

Sum ({<[Customer Name]={"=Sum(Sales)<Fractile(total Sales, 0.99)"}>}Sales)

jonathandienst
Partner - Champion III
Partner - Champion III

Often you will find outliers by using the Fractile() function. For example, include only the best 90%, use something like:

     =If(Sum(Value) > Fractile(Sum(Value), 0.05) And Sum(Value) < Fractile(Sum(Value), 0.95), Sum(Value))

Logic will get you from a to b. Imagination will take you everywhere. - A Einstein
abhishek_singla
Partner - Contributor II
Partner - Contributor II

Hi Gautham Prasad,

What do you define outliers?

I've filter out the outliers based on following formula:

As per Inter quartile range (IQR), outliers are those whose value is

1. value > Q3 + 1.5*IQR

2. value < Q1 - 1.5*IQR

See below pic.

test.PNG

I've done this in qlik sense. PFA .qvf file.