<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Removing duplicate people with many fields in QlikView</title>
    <link>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473605#M482510</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi! &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a database with the following information: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Person.LastName&lt;/P&gt;&lt;P&gt;Person.FirstName&lt;/P&gt;&lt;P&gt;Person.BirthDate&lt;/P&gt;&lt;P&gt;Person.IdNumber (which consists of birthdate DDMMYY-NNNN and four characters in the end)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The case is that all people have all five, all people have something in Person.IdNumber but not in standard form and some people might have their first names or last names in the opposite fields. Birthdate is most likely a field which everybody has but there might be some duplicates because people might have been born on the same day. I'm thinking of doing this in the following way: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Rows are duplicates:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;IF IdNumbers are same&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;or all of the following apply:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Person.IdNumber is not in a standard form MMDDYY-NNNN because NNNN might be missing&lt;/P&gt;&lt;P&gt;IF BirthDates are the same &lt;/P&gt;&lt;P&gt;IF LastNames are the same &lt;/P&gt;&lt;P&gt;IF FirstNames are the same&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm having trouble starting to script this. Could someone help me out?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;All the best, &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Mikael&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 07 Aug 2013 14:55:57 GMT</pubDate>
    <dc:creator />
    <dc:date>2013-08-07T14:55:57Z</dc:date>
    <item>
      <title>Removing duplicate people with many fields</title>
      <link>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473605#M482510</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi! &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a database with the following information: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Person.LastName&lt;/P&gt;&lt;P&gt;Person.FirstName&lt;/P&gt;&lt;P&gt;Person.BirthDate&lt;/P&gt;&lt;P&gt;Person.IdNumber (which consists of birthdate DDMMYY-NNNN and four characters in the end)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The case is that all people have all five, all people have something in Person.IdNumber but not in standard form and some people might have their first names or last names in the opposite fields. Birthdate is most likely a field which everybody has but there might be some duplicates because people might have been born on the same day. I'm thinking of doing this in the following way: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Rows are duplicates:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;IF IdNumbers are same&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;or all of the following apply:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Person.IdNumber is not in a standard form MMDDYY-NNNN because NNNN might be missing&lt;/P&gt;&lt;P&gt;IF BirthDates are the same &lt;/P&gt;&lt;P&gt;IF LastNames are the same &lt;/P&gt;&lt;P&gt;IF FirstNames are the same&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm having trouble starting to script this. Could someone help me out?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;All the best, &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Mikael&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Aug 2013 14:55:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473605#M482510</guid>
      <dc:creator />
      <dc:date>2013-08-07T14:55:57Z</dc:date>
    </item>
    <item>
      <title>Re: Removing duplicate people with many fields</title>
      <link>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473606#M482511</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Can you not use a&amp;nbsp; Load distinct&amp;nbsp;&amp;nbsp; in the script for deduplication ?&lt;/P&gt;&lt;P&gt;If you data is not standardized, you'll have to do a cleanup as well. For instance formatting birthdate to a standard date.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Aug 2013 16:17:23 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473606#M482511</guid>
      <dc:creator />
      <dc:date>2013-08-07T16:17:23Z</dc:date>
    </item>
    <item>
      <title>Re: Removing duplicate people with many fields</title>
      <link>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473607#M482512</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Load &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;LastName,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;FirstName,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;BirthDate,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;IdNumber,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;BirthDate &amp;amp;'|'&amp;amp; FirstName &amp;amp;'|'&amp;amp; LastName as ID2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;From SourceFile &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Where not Exists(IdNumber) and Not Exists(ID2, BirthDate &amp;amp;'|'&amp;amp; FirstName &amp;amp;'|'&amp;amp; LastName); &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 Aug 2013 16:39:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/QlikView/Removing-duplicate-people-with-many-fields/m-p/473607#M482512</guid>
      <dc:creator>hic</dc:creator>
      <dc:date>2013-08-07T16:39:15Z</dc:date>
    </item>
  </channel>
</rss>

