<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Doing Data Science-ey stuff (linear regression lines in QlikView) in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Doing-Data-Science-ey-stuff-linear-regression-lines-in-QlikView/m-p/1525943#M438410</link>
    <description>&lt;P&gt;&lt;STRONG&gt;Hello Qlik-ers,&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;In the mad-rush to get ever more &lt;EM&gt;“scientific”&lt;/EM&gt; about data, exploring Qlik features that fall under this thorough statistical genre have yielded some useful examples. However, some chart settings &amp;amp; combinations still behave in puzzling ways.&lt;BR /&gt;&lt;BR /&gt;Specifically I sought a foundation for plotting linear regressions when the data consists of a variety of X,Y series concatenated to a single table. In this manner, the underlying data model can “bundle” together disparate topics, pulled into visualization focus by applying selection.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This example combines 3 data topics, each with possible multiple subsets:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;The Anscombe quartet &lt;EM&gt;(a useful series for calibrating linear regression calculations) &lt;/EM&gt;(&lt;A href="https://en.wikipedia.org/wiki/Anscombe%27s_quartet" target="_blank"&gt;https://en.wikipedia.org/wiki/Anscombe%27s_quartet&lt;/A&gt;)&lt;/LI&gt;&lt;LI&gt;Masses for American women as function of height age 30–39(&lt;A href="https://en.wikipedia.org/wiki/Simple_linear_regression" target="_blank"&gt;https://en.wikipedia.org/wiki/Simple_linear_regression&lt;/A&gt;) – &lt;EM&gt;html table @2&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;Randomized X, Y coordinates in sets of 100 points&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Each X,Y data point is tagged with a 2-hierarchy dimension: &lt;EM&gt;{Topic} | {Subset Name}&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;The Anscombe data set was used specifically because it verifies plot lines were correct.&amp;nbsp;&amp;nbsp; The premise is each of the four sets in the quartet produce identical linear regression.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" style="width: 744px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2948i1F1969A2A390D67E/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" alt="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The Combo chart did some interesting things.&amp;nbsp;&amp;nbsp;If two points shared the same value of either identical X,Y value, the chart would collapse these points so by "cheating" the points were given distinction by adding a near insignificant small&amp;nbsp;sub-decimal amount to force them to plot as separate points.&lt;BR /&gt;&lt;BR /&gt;The&amp;nbsp;trendlines on the combo chart also seem to create the linear regression trendlines, but when trellis was engaged,&amp;nbsp;started to perform erractically.&amp;nbsp; Also when the combo chart in trellis form was fed more than 100 data points, it automatically switched over to a line chart &lt;EM&gt;(even when line was not&amp;nbsp;activated and the expression was instructed to only use symbol)&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2949i82BE384992C5F939/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" alt="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" /&gt;&lt;/span&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;Naturally the X,Y scatter seems best choice for generating linear regression, however it wasn’t necessarily intuitive this could only be activated by having the Y expression in focus, whereas trendlines checkboxes are greyed out when the X expression is in focus.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" style="width: 784px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2947i257EB9B1CCA0CE63/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" alt="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Ideally I’d like to take the linear regression expression syntaxes being shown via trendlines and recreate these as reference lines. Is it possible to extract the linear regression calculations pre-packaged to checkboxes, and use these elsewhere? &lt;EM&gt;(i.e. take ‘y=.026998x + 8.632’ and use it as a reference line).&lt;BR /&gt;&lt;BR /&gt;Thanks for any consideration you can give the topic! Appreciate your thoughts &amp;amp; feedback. &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2950i9D44DF41FA84BAD4/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" alt="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" /&gt;&lt;/span&gt;&lt;/EM&gt;&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 21:39:57 GMT</pubDate>
    <dc:creator>evan_kurowski</dc:creator>
    <dc:date>2024-11-16T21:39:57Z</dc:date>
    <item>
      <title>Doing Data Science-ey stuff (linear regression lines in QlikView)</title>
      <link>https://community.qlik.com/t5/QlikView/Doing-Data-Science-ey-stuff-linear-regression-lines-in-QlikView/m-p/1525943#M438410</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Hello Qlik-ers,&lt;/STRONG&gt;&lt;BR /&gt;&lt;BR /&gt;In the mad-rush to get ever more &lt;EM&gt;“scientific”&lt;/EM&gt; about data, exploring Qlik features that fall under this thorough statistical genre have yielded some useful examples. However, some chart settings &amp;amp; combinations still behave in puzzling ways.&lt;BR /&gt;&lt;BR /&gt;Specifically I sought a foundation for plotting linear regressions when the data consists of a variety of X,Y series concatenated to a single table. In this manner, the underlying data model can “bundle” together disparate topics, pulled into visualization focus by applying selection.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This example combines 3 data topics, each with possible multiple subsets:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;The Anscombe quartet &lt;EM&gt;(a useful series for calibrating linear regression calculations) &lt;/EM&gt;(&lt;A href="https://en.wikipedia.org/wiki/Anscombe%27s_quartet" target="_blank"&gt;https://en.wikipedia.org/wiki/Anscombe%27s_quartet&lt;/A&gt;)&lt;/LI&gt;&lt;LI&gt;Masses for American women as function of height age 30–39(&lt;A href="https://en.wikipedia.org/wiki/Simple_linear_regression" target="_blank"&gt;https://en.wikipedia.org/wiki/Simple_linear_regression&lt;/A&gt;) – &lt;EM&gt;html table @2&lt;/EM&gt;&lt;/LI&gt;&lt;LI&gt;Randomized X, Y coordinates in sets of 100 points&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Each X,Y data point is tagged with a 2-hierarchy dimension: &lt;EM&gt;{Topic} | {Subset Name}&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;The Anscombe data set was used specifically because it verifies plot lines were correct.&amp;nbsp;&amp;nbsp; The premise is each of the four sets in the quartet produce identical linear regression.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" style="width: 744px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2948i1F1969A2A390D67E/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" alt="20190103_QV_Linear_Regression_03_Combo_and_Scatter_w_Trellis_using_Y_expression_trendlines_showing_ANSCOMBE.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The Combo chart did some interesting things.&amp;nbsp;&amp;nbsp;If two points shared the same value of either identical X,Y value, the chart would collapse these points so by "cheating" the points were given distinction by adding a near insignificant small&amp;nbsp;sub-decimal amount to force them to plot as separate points.&lt;BR /&gt;&lt;BR /&gt;The&amp;nbsp;trendlines on the combo chart also seem to create the linear regression trendlines, but when trellis was engaged,&amp;nbsp;started to perform erractically.&amp;nbsp; Also when the combo chart in trellis form was fed more than 100 data points, it automatically switched over to a line chart &lt;EM&gt;(even when line was not&amp;nbsp;activated and the expression was instructed to only use symbol)&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2949i82BE384992C5F939/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" alt="20190103_QV_Linear_Regression_02_Combo_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" /&gt;&lt;/span&gt;&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;Naturally the X,Y scatter seems best choice for generating linear regression, however it wasn’t necessarily intuitive this could only be activated by having the Y expression in focus, whereas trendlines checkboxes are greyed out when the X expression is in focus.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" style="width: 784px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2947i257EB9B1CCA0CE63/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" alt="20190103_QV_Linear_Regression_04_Trendline_settings_on_the_Scatter_chart.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Ideally I’d like to take the linear regression expression syntaxes being shown via trendlines and recreate these as reference lines. Is it possible to extract the linear regression calculations pre-packaged to checkboxes, and use these elsewhere? &lt;EM&gt;(i.e. take ‘y=.026998x + 8.632’ and use it as a reference line).&lt;BR /&gt;&lt;BR /&gt;Thanks for any consideration you can give the topic! Appreciate your thoughts &amp;amp; feedback. &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" style="width: 999px;"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/2950i9D44DF41FA84BAD4/image-size/large?v=v2&amp;amp;px=999" role="button" title="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" alt="20190103_QV_Linear_Regression_01_Scatter_w_Trellis_using_Y_expression_trendlines_showing_RANDOM.png" /&gt;&lt;/span&gt;&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 21:39:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Doing-Data-Science-ey-stuff-linear-regression-lines-in-QlikView/m-p/1525943#M438410</guid>
      <dc:creator>evan_kurowski</dc:creator>
      <dc:date>2024-11-16T21:39:57Z</dc:date>
    </item>
  </channel>
</rss>

