<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: tRecordMatching blocking definition excludes every thing in Data Quality</title>
    <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281168#M3607</link>
    <description>Hi xnmo, 
&lt;BR /&gt;Your expectations are correct. You should get some matches if the postal code and street names are the same. 
&lt;BR /&gt;Instead of choosing "exact match", try another algorithm such as q-gram, Levenshtein or Jaro-Winkler. 
&lt;BR /&gt;If you still don't get any matches or possible matches, I would lower the matching threshold in the advanced settings of the component in order to try to get a few matches and understand what happens. 
&lt;BR /&gt;Maybe, you need to trim your postal codes or street names before doing the match. 
&lt;BR /&gt;If nothing works, are you sure that there are matches? Can you exhibit one example and provide a job that shows the problem?</description>
    <pubDate>Mon, 19 Sep 2011 08:39:15 GMT</pubDate>
    <dc:creator>Sebastiao_Qlik</dc:creator>
    <dc:date>2011-09-19T08:39:15Z</dc:date>
    <item>
      <title>tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281167#M3606</link>
      <description>I have a source that needs cleaning street names. I have a reference source with valid combinations of danish postal codes and street names. A street name can of course occur in multiple postal codes. In a sample input I have 80 rows looking like this: 
&lt;BR /&gt;1000;2200;Rantzausgade 
&lt;BR /&gt;First number is an ID, second is postal code and the last is street name. 
&lt;BR /&gt;In a lookup table (database) i have rows like this: 
&lt;BR /&gt;61; 1057;Holbergsgade 
&lt;BR /&gt;Again the first number is an ID (unrelated to the ID in the sample) 
&lt;BR /&gt;In a first attempt I only specify Key definition Input key and Lookup key as street names (exact match to be replaced with some thing fuzzy later). 
&lt;BR /&gt;As it happens with this particular sample, I get 80 rows in the Matches output and 0 rows in the Possible Matches. The matches includes a few samples where the postal codes are actually different (I have added the lookup postal code to the output) but most of them have matching postal code. 
&lt;BR /&gt;Now I add postal code from input and postal code lookup to the blocking definition - and I get 0 rows in both matches and possible matches. Here I expected that I would only get matches when both street name and postal code join. I supposed the blocking definition would work like in tMatchGroup ... 
&lt;BR /&gt;What am I doing wrong? Thanks! 
&lt;BR /&gt;PS I originally posted this under Open Studio,</description>
      <pubDate>Sat, 16 Nov 2024 12:40:48 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281167#M3606</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T12:40:48Z</dc:date>
    </item>
    <item>
      <title>Re: tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281168#M3607</link>
      <description>Hi xnmo, 
&lt;BR /&gt;Your expectations are correct. You should get some matches if the postal code and street names are the same. 
&lt;BR /&gt;Instead of choosing "exact match", try another algorithm such as q-gram, Levenshtein or Jaro-Winkler. 
&lt;BR /&gt;If you still don't get any matches or possible matches, I would lower the matching threshold in the advanced settings of the component in order to try to get a few matches and understand what happens. 
&lt;BR /&gt;Maybe, you need to trim your postal codes or street names before doing the match. 
&lt;BR /&gt;If nothing works, are you sure that there are matches? Can you exhibit one example and provide a job that shows the problem?</description>
      <pubDate>Mon, 19 Sep 2011 08:39:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281168#M3607</guid>
      <dc:creator>Sebastiao_Qlik</dc:creator>
      <dc:date>2011-09-19T08:39:15Z</dc:date>
    </item>
    <item>
      <title>Re: tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281169#M3608</link>
      <description>Hi Sebastiao,&lt;BR /&gt;I tried various setting without any success. I also trimmed without success. I shouldn't matter though since both database and sample file has an Integer field. I tried to export a 1000 line sample from the database, and switched input to this sample file. And that works - I was lucky enough to get 9 matches.&lt;BR /&gt;To me it looks like there is a failure in comparing an integer field from the sample file with an integer field from the database. But it works with integers from two files. Could I be doing something wrong around comparing int with Integer (nah, would give compile errors)? Any ideas how I could dig deeper into this?&lt;BR /&gt;Regards Niels</description>
      <pubDate>Mon, 19 Sep 2011 11:57:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281169#M3608</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-09-19T11:57:26Z</dc:date>
    </item>
    <item>
      <title>Re: tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281170#M3609</link>
      <description>Hi Niels,&lt;BR /&gt;if it's related to the database type, I would try to convert the integer to a string via the tMap (or tJavaRow) before the entry of the tRecordMatching component. &lt;BR /&gt;In the meanwhile, I suggest that you raise an issue in our bugtracker &lt;A href="http://www.talendforge.org/bugs/" rel="nofollow noopener noreferrer"&gt;http://www.talendforge.org/bugs/&lt;/A&gt; &lt;BR /&gt;Please, write there the information about the version of Talend's product, the database from which you get the lookup data, the database schema.&lt;BR /&gt;If you can give us a sample job and data that reproduces the problem, then it will make our work easier.</description>
      <pubDate>Mon, 19 Sep 2011 13:03:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281170#M3609</guid>
      <dc:creator>Sebastiao_Qlik</dc:creator>
      <dc:date>2011-09-19T13:03:33Z</dc:date>
    </item>
    <item>
      <title>Re: tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281171#M3610</link>
      <description>Hi Sebastiao,&lt;BR /&gt;I have raised the issue with sample jobs and sample data. It has also occurred to me that I could try putting the input data in a MySql table instead of reading it from a file - which is what I need to do for the "real" job.&lt;BR /&gt;Thanks for  helping!&lt;BR /&gt;Niels</description>
      <pubDate>Tue, 20 Sep 2011 10:54:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281171#M3610</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2011-09-20T10:54:33Z</dc:date>
    </item>
    <item>
      <title>Re: tRecordMatching blocking definition excludes every thing</title>
      <link>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281172#M3611</link>
      <description>Thanks for the issue, we'll have a look at it.
&lt;BR /&gt;The link is 
&lt;A href="http://jira.talendforge.org/browse/TDQ-3609" rel="nofollow noopener noreferrer"&gt;http://jira.talendforge.org/browse/TDQ-3609&lt;/A&gt;</description>
      <pubDate>Tue, 20 Sep 2011 13:34:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Data-Quality/tRecordMatching-blocking-definition-excludes-every-thing/m-p/2281172#M3611</guid>
      <dc:creator>Sebastiao_Qlik</dc:creator>
      <dc:date>2011-09-20T13:34:53Z</dc:date>
    </item>
  </channel>
</rss>

