<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Join data duplicating records in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138440#M1224836</link>
    <description>&lt;P&gt;Client ID, Pol/Cont Num, effective date and Unique ID are all shared fields but it should only be using Unique ID to match. The rest of the columns have potential duplicates so how can I force it to match based only on unique ID?&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/6142"&gt;@Or&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 16 Nov 2023 14:48:49 GMT</pubDate>
    <dc:creator>qlikuser233</dc:creator>
    <dc:date>2023-11-16T14:48:49Z</dc:date>
    <item>
      <title>Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138430#M1224833</link>
      <description>&lt;P&gt;I am joining two data sources, the issue is that I am brining in certain columns for each and they share 4 columns with the primary key being unique identifier. The columns that are missing from Tracker 2 are populating as 0's instead of merging into one line.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;
&lt;DIV&gt;Tracker:&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;LOAD&amp;nbsp; Client,&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Client ID],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Team Cd],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Policy/Contract Effective Date],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Policy/Contract Expiration Date],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Pol/Cont Num],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Captive or Non-Captive],&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Policy/Contract Effective Date Conversion],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;Adjustment,&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[Client ID]&amp;amp;[Pol/Cont Num]&amp;amp;[Policy/Contract Effective Date] as [Unique Identifier],&amp;nbsp;&lt;BR /&gt;[3133]&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;[3137]&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;Outer join (Tracker)&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;Tracker_2:&lt;/DIV&gt;
&lt;DIV&gt;LOAD A as [Client ID],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;B as [Client],&amp;nbsp;&lt;BR /&gt;&amp;nbsp; &amp;nbsp;C as [Pol/Cont Num]&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;D as [Policy/Contract Effective Date],&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;I as [Unique Identifier],&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt; Z as [Timing - Waiting for Registration (New/RNW or Adjustment) - Premium - COU, PAU, ACR],&lt;BR /&gt;&lt;BR /&gt;Attached is a photo for one record. It splits into 2 lines, the top line is coming from tracker 2 as it has 0's where it should be filling in the information from Tracker. I have crossed out the unique identifier number but they are identical so I cannot figure out why this is creating a line for each source rather than merging them when the primary key for both sources is identical.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:02:46 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138430#M1224833</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T15:02:46Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138432#M1224834</link>
      <description>&lt;P&gt;You marked out the exact parts needed to try and see what's going on, so having to go on a guess here, but... keep in mind that joins are based on &lt;STRONG&gt;all fields with identical names&lt;/STRONG&gt;. There's no concept of "primary key" in a Qlik based join. &lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 14:41:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138432#M1224834</guid>
      <dc:creator>Or</dc:creator>
      <dc:date>2023-11-16T14:41:20Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138438#M1224835</link>
      <description>&lt;P&gt;Everything I marked out is identical but I can redo it with sample data:&amp;nbsp;&lt;/P&gt;
&lt;TABLE style="width: 100%;" border="1" width="100%"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="7.142857142857143%" height="47px"&gt;Client ID&lt;/TD&gt;
&lt;TD width="7.142857142857143%" height="47px"&gt;Client&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;Pol/Cont Num&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;effective date&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;Unique ID&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;3033&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;3137&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="47px"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD height="25px"&gt;123&lt;/TD&gt;
&lt;TD height="25px"&gt;Crumbl&lt;/TD&gt;
&lt;TD height="25px"&gt;456&lt;/TD&gt;
&lt;TD height="25px"&gt;999&lt;/TD&gt;
&lt;TD height="25px"&gt;123456999&lt;/TD&gt;
&lt;TD height="25px"&gt;0&lt;/TD&gt;
&lt;TD height="25px"&gt;0&lt;/TD&gt;
&lt;TD height="25px"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD width="7.142857142857143%" height="25px"&gt;123&lt;/TD&gt;
&lt;TD width="7.142857142857143%" height="25px"&gt;Crumbl&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;456&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;999&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;123456999&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;1&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;587&lt;/TD&gt;
&lt;TD width="14.285714285714286%" height="25px"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&lt;BR /&gt;so 3033 and 3137 are coming from Tracker and I would expect to see one line here but it is creating 2.&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/6142"&gt;@Or&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:02:07 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138438#M1224835</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T15:02:07Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138440#M1224836</link>
      <description>&lt;P&gt;Client ID, Pol/Cont Num, effective date and Unique ID are all shared fields but it should only be using Unique ID to match. The rest of the columns have potential duplicates so how can I force it to match based only on unique ID?&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/6142"&gt;@Or&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 14:48:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138440#M1224836</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T14:48:49Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138441#M1224837</link>
      <description>&lt;P&gt;These don't seem to match up - you create the unique ID by concatenating a date, but in this case it's clearly not based on that date? It's also missing at least one of the fields that has an identical name, [Policy/Contract Effective Date], which in your screen capture is showing different values between rows.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 14:50:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138441#M1224837</guid>
      <dc:creator>Or</dc:creator>
      <dc:date>2023-11-16T14:50:08Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138443#M1224838</link>
      <description>&lt;P&gt;You would want to change the field names on one side - presumably on the outer side - for anything that isn't supposed to match. Otherwise, I don't see how you'd expect to end up with one row when the field has two potential values. You could add a second run with a Group By to handle the doubled rows in whichever way you want to, though.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 14:52:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138443#M1224838</guid>
      <dc:creator>Or</dc:creator>
      <dc:date>2023-11-16T14:52:24Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138446#M1224839</link>
      <description>&lt;P&gt;&amp;nbsp;m&lt;/P&gt;</description>
      <pubDate>Thu, 07 Dec 2023 20:07:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138446#M1224839</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-12-07T20:07:30Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138450#M1224840</link>
      <description>&lt;P&gt;Again, you can't join two tables which have different values in a column and expect anything other than duplication. This is the correct result. If you want to achieve a different result, you need to use a Group By function to deal with the "duplication" and tell Qlik how you want to pick a value between the two options, or rename the columns so they don't share the same name.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:02:01 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138450#M1224840</guid>
      <dc:creator>Or</dc:creator>
      <dc:date>2023-11-16T15:02:01Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138454#M1224841</link>
      <description>&lt;P&gt;You are not looking really on the table-data else on an UI view. This means the extra row might not come from the join-statement else from another association between your data.&lt;/P&gt;
&lt;P&gt;Just comment everything else from your script unless this join-stuff. Further adding a recno() and a rowno() to the load will be quite helpful to track what happens - and then put everything in a table-box and not a table-chart. If it results really in any duplicates it would mean that there are duplicates within the sources and/or that the key-values are exactly matching together. For example identically looking data must not mandatory be equally like a real date and a per date() formatted timestamp ...&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:08:50 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138454#M1224841</guid>
      <dc:creator>marcus_sommer</dc:creator>
      <dc:date>2023-11-16T15:08:50Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138457#M1224842</link>
      <description>&lt;P&gt;They don't have different values in the columns. The shared columns in each source contain identical values. The only thing I am trying to do is add columns on to the Tracker table using Unique Identifier. If I rename the columns then that would just create further duplication. Why would it not use the unique ID to pull in info from the other columns?&amp;nbsp;&lt;/P&gt;
&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;These columns are identical in both&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;A as [Client ID],&lt;/STRONG&gt;&lt;/EM&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;EM&gt;&lt;STRONG&gt;C as [Pol/Cont Num],&lt;/STRONG&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;EM&gt;&lt;STRONG&gt;D as [Policy/Contract Effective Date],&lt;/STRONG&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp; &amp;nbsp;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;&lt;STRONG&gt;&amp;nbsp;I as [Unique Identifier],&lt;/STRONG&gt;&lt;/EM&gt;&amp;nbsp;&amp;nbsp;&lt;BR /&gt;The only new columns I need from Tracker 2 are from V-AA.&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/6142"&gt;@Or&lt;/a&gt;&amp;nbsp; If I am misunderstanding how would I use group by with the ID to achieve this?&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:11:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138457#M1224842</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T15:11:12Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138463#M1224843</link>
      <description>&lt;P&gt;I can see that in the table it is taking my unique ID and breaking it down into both sources so I have a line for each. The first row contains data for a column not in Tracker 2 and vice versa for Tracker. So why is it breaking the data sources out instead of merging them?&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/28038"&gt;@marcus_sommer&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:17:24 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138463#M1224843</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T15:17:24Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138469#M1224844</link>
      <description>&lt;P&gt;Like&amp;nbsp;&lt;a href="https://community.qlik.com/t5/user/viewprofilepage/user-id/6142"&gt;@Or&lt;/a&gt;&amp;nbsp;mentioned a join is performed against all key-fields which means that all field-values of them must be matching.&lt;/P&gt;
&lt;P&gt;Further missing keys and/or any duplicates will have an impact on the resulting data-set and may adding respectively removing records (depending on the applied join-type). To get rid of it could be quite difficult especially if there is not enough knowledge and control in regard to the data-quality. Therefore I suggest to change the join-logic to a mapping which is often easier, more flexible and performant and without any risk to change the number of records:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.qlik.com/t5/Design/Don-t-join-use-Applymap-instead/ba-p/1467592" target="_blank"&gt;Don't join - use Applymap instead - Qlik Community - 1467592&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:29:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138469#M1224844</guid>
      <dc:creator>marcus_sommer</dc:creator>
      <dc:date>2023-11-16T15:29:34Z</dc:date>
    </item>
    <item>
      <title>Re: Join data duplicating records</title>
      <link>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138474#M1224845</link>
      <description>&lt;P&gt;I have figured out the issue, there was a same column name from both sources one was a calculated field and the other a hard coded input that was different once I removed there was no issue. Thanks everyone&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:38:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Join-data-duplicating-records/m-p/2138474#M1224845</guid>
      <dc:creator>qlikuser233</dc:creator>
      <dc:date>2023-11-16T15:38:54Z</dc:date>
    </item>
  </channel>
</rss>

