<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Remove duplicate data in field in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434075#M161800</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi There,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Been battling with this issue for a while and hoping someone can help.&amp;nbsp; I have the below script I'm using to pull data from a spreadsheet.&amp;nbsp; Within this data, there's a "Cache_Reference_Number.&amp;nbsp; What I'm trying to do is ignore rows where the Cache Reference number is duplicated.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;MYDATA:&lt;/P&gt;&lt;P&gt;LOAD Cache_Id, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Crt_Date, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Broker_Code, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Reference_Number, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Incoming&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt; From&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;SPAN style="font-size: 10pt;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;C:\Users\Nathan\Documents\ExternalData.xlsx&lt;/P&gt;&lt;P&gt;(ooxml, embedded labels, table is Sheet1);&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For example, the below are some Cache Reference Numbers from the file.&amp;nbsp; I'd want to ignore all the "&lt;SPAN style="text-align: -webkit-right;"&gt;18312102" rows except for the last one.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="0" cellpadding="0" cellspacing="0" style="width: 183px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD align="right" height="20" width="183"&gt;18312100&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;&lt;SPAN style="text-align: -webkit-right;"&gt;18312151&lt;/SPAN&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312103&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help on this would be appreciated&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 08 Apr 2013 21:06:09 GMT</pubDate>
    <dc:creator />
    <dc:date>2013-04-08T21:06:09Z</dc:date>
    <item>
      <title>Remove duplicate data in field</title>
      <link>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434075#M161800</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi There,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Been battling with this issue for a while and hoping someone can help.&amp;nbsp; I have the below script I'm using to pull data from a spreadsheet.&amp;nbsp; Within this data, there's a "Cache_Reference_Number.&amp;nbsp; What I'm trying to do is ignore rows where the Cache Reference number is duplicated.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;MYDATA:&lt;/P&gt;&lt;P&gt;LOAD Cache_Id, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Crt_Date, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Broker_Code, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Reference_Number, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Cache_Incoming&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt; From&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;SPAN style="font-size: 10pt;"&gt; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;C:\Users\Nathan\Documents\ExternalData.xlsx&lt;/P&gt;&lt;P&gt;(ooxml, embedded labels, table is Sheet1);&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For example, the below are some Cache Reference Numbers from the file.&amp;nbsp; I'd want to ignore all the "&lt;SPAN style="text-align: -webkit-right;"&gt;18312102" rows except for the last one.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="0" cellpadding="0" cellspacing="0" style="width: 183px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD align="right" height="20" width="183"&gt;18312100&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;&lt;SPAN style="text-align: -webkit-right;"&gt;18312151&lt;/SPAN&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312102&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD align="right" height="20"&gt;18312103&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help on this would be appreciated&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 08 Apr 2013 21:06:09 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434075#M161800</guid>
      <dc:creator />
      <dc:date>2013-04-08T21:06:09Z</dc:date>
    </item>
    <item>
      <title>Re: Remove duplicate data in field</title>
      <link>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434076#M161801</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you say 'except for the last one' you are referring to the input order of MYDATA or maybe the Cache_Crt_Date, right?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Either way, you should be able to resident load your data with your records ordered descending, and use a where not exists() clause to only load the 'last' reference number. Something along these lines:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;INPUT:&lt;/P&gt;&lt;P&gt;LOAD&amp;nbsp; Cache_Reference_Number as&amp;nbsp; Cache_Reference_Number_Full, &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RecNo() as ID &lt;/P&gt;&lt;P&gt;INLINE [&lt;/P&gt;&lt;P&gt;Cache_Reference_Number&lt;/P&gt;&lt;P&gt;18312100&lt;/P&gt;&lt;P&gt;18312102&lt;/P&gt;&lt;P&gt;18312151&lt;/P&gt;&lt;P&gt;18312102&lt;/P&gt;&lt;P&gt;18312102&lt;/P&gt;&lt;P&gt;18312103&lt;/P&gt;&lt;P&gt;];&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;RESULT:&lt;/P&gt;&lt;P&gt;LOAD&amp;nbsp; Cache_Reference_Number_Full as&amp;nbsp; Cache_Reference_Number,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ID &lt;/P&gt;&lt;P&gt;Resident INPUT &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;where not exists( Cache_Reference_Number,&amp;nbsp; Cache_Reference_Number_Full)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;order by ID desc ;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;drop table INPUT;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 08 Apr 2013 23:24:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434076#M161801</guid>
      <dc:creator>swuehl</dc:creator>
      <dc:date>2013-04-08T23:24:21Z</dc:date>
    </item>
    <item>
      <title>Re: Remove duplicate data in field</title>
      <link>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434077#M161802</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey there, thanks so much for the response.&amp;nbsp; I ended up doing the below, ordering the Cache_Reference_Number and then using Peek, but I think I prefer your implementation.&amp;nbsp; Thanks for the help.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;NoConcatenate&lt;/P&gt;&lt;P&gt;MYNEWDATA:&lt;/P&gt;&lt;P&gt;Load *&lt;/P&gt;&lt;P&gt;Resident MYDATA Order by Cache_Reference_Number;&lt;/P&gt;&lt;P&gt;Drop table MYDATA;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;NoConcatenate&lt;/P&gt;&lt;P&gt;DuplicatesRemoved:&lt;/P&gt;&lt;P&gt;Load *&lt;/P&gt;&lt;P&gt;Resident MYNEWDATA&lt;/P&gt;&lt;P&gt;Where Peek(Cache_Reference_Number)&amp;lt;&amp;gt;Cache_Reference_Number;&lt;/P&gt;&lt;P&gt;drop table MYNEWDATA;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 09 Apr 2013 06:11:16 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Remove-duplicate-data-in-field/m-p/434077#M161802</guid>
      <dc:creator />
      <dc:date>2013-04-09T06:11:16Z</dc:date>
    </item>
  </channel>
</rss>

