<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fractile function to delete outliers in Connectivity &amp; Data Prep</title>
    <link>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120959#M12825</link>
    <description>&lt;P&gt;You need at least 3 steps to exclude such records, for example (simplified):&lt;/P&gt;
&lt;P&gt;t: load * from Source;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; left join(t) load Key, frac(0.01) as Min,&amp;nbsp;frac(0.99) as Max resident t group by Key;&lt;/P&gt;
&lt;P&gt;t2: noconcatenate load * resident t where val &amp;gt;= Min and val &amp;lt;= Max;&lt;BR /&gt;drop tables t; drop fields Min, Max;&lt;/P&gt;
&lt;P&gt;More simple would it be if you would specify valide values - means using fixed values in the condition like:&lt;/P&gt;
&lt;P&gt;...&amp;nbsp;where val &amp;gt;= 0.1 and val &amp;lt;= 99;&lt;/P&gt;
&lt;P&gt;Another simple approach would be not to exclude any records else to apply a valide value check within if-loops to create a flag-field or maybe using class() to cluster the values. Flags and cluster could be then used as selections and/or dimensions and/or set analysis conditions.&lt;/P&gt;</description>
    <pubDate>Thu, 21 Sep 2023 12:47:37 GMT</pubDate>
    <dc:creator>marcus_sommer</dc:creator>
    <dc:date>2023-09-21T12:47:37Z</dc:date>
    <item>
      <title>Fractile function to delete outliers</title>
      <link>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120926#M12823</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="LeonardoB25_0-1695295352582.png" style="width: 400px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/116744i9B1F062ED65BDC2A/image-size/medium?v=v2&amp;amp;px=400" role="button" title="LeonardoB25_0-1695295352582.png" alt="LeonardoB25_0-1695295352582.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Hello, I'm trying to use the Fractile() function to delete outliers from my table, but I can't manage to make it work. Documentation specifying a general &amp;lt;expr&amp;gt; in the first function clause doesn't help either.&lt;/P&gt;
&lt;P&gt;I have a numeric colum (Valore Riga) and I just want to keep the most central 99% of data, in order to exclude extreme values from the dataset.&lt;/P&gt;
&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Sep 2023 11:26:13 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120926#M12823</guid>
      <dc:creator>LeonardoB25</dc:creator>
      <dc:date>2023-09-21T11:26:13Z</dc:date>
    </item>
    <item>
      <title>Re: Fractile function to delete outliers</title>
      <link>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120932#M12824</link>
      <description>&lt;P&gt;It seems like this would be easier to achieve by loading the data, getting the count of rows, and then only loading rows with RowNo() greater than the bottom 0.5% and less than the top 0.5%. Calculating fractiles doesn't seem to be necessary. That said, you could calculate the fractile in a separate load (this requires a GROUP BY) and then use that to filter in a following load. &lt;/P&gt;</description>
      <pubDate>Thu, 21 Sep 2023 11:43:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120932#M12824</guid>
      <dc:creator>Or</dc:creator>
      <dc:date>2023-09-21T11:43:20Z</dc:date>
    </item>
    <item>
      <title>Re: Fractile function to delete outliers</title>
      <link>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120959#M12825</link>
      <description>&lt;P&gt;You need at least 3 steps to exclude such records, for example (simplified):&lt;/P&gt;
&lt;P&gt;t: load * from Source;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; left join(t) load Key, frac(0.01) as Min,&amp;nbsp;frac(0.99) as Max resident t group by Key;&lt;/P&gt;
&lt;P&gt;t2: noconcatenate load * resident t where val &amp;gt;= Min and val &amp;lt;= Max;&lt;BR /&gt;drop tables t; drop fields Min, Max;&lt;/P&gt;
&lt;P&gt;More simple would it be if you would specify valide values - means using fixed values in the condition like:&lt;/P&gt;
&lt;P&gt;...&amp;nbsp;where val &amp;gt;= 0.1 and val &amp;lt;= 99;&lt;/P&gt;
&lt;P&gt;Another simple approach would be not to exclude any records else to apply a valide value check within if-loops to create a flag-field or maybe using class() to cluster the values. Flags and cluster could be then used as selections and/or dimensions and/or set analysis conditions.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Sep 2023 12:47:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Connectivity-Data-Prep/Fractile-function-to-delete-outliers/m-p/2120959#M12825</guid>
      <dc:creator>marcus_sommer</dc:creator>
      <dc:date>2023-09-21T12:47:37Z</dc:date>
    </item>
  </channel>
</rss>

