<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic LIKE operator on same dataset in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293026#M66045</link>
    <description>&lt;P&gt;Hi all,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Just for an example, consider the following data&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="like.PNG" style="width: 425px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Ltuc.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138448i872C809978F3E8DF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Ltuc.png" alt="0683p000009Ltuc.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I want to apply&amp;nbsp;&lt;STRONG&gt;LIKE&lt;/STRONG&gt;&amp;nbsp;operator on NAME and CITY and &lt;STRONG&gt;Equal&lt;/STRONG&gt; operator on STATE and ZIP columns so that I can expect following output :&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Monroe Township, NJ &amp;nbsp;| &amp;nbsp;Monroe Township &amp;nbsp;| &amp;nbsp;NJ &amp;nbsp;| &amp;nbsp;08831 ........ i.e. First occurrence only.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I've tried tFilterRow component but don't know how to apply it for this requirement.Which component or steps or function should I apply to get this result ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks !&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 23 Jun 2017 08:58:38 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2017-06-23T08:58:38Z</dc:date>
    <item>
      <title>LIKE operator on same dataset</title>
      <link>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293026#M66045</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Just for an example, consider the following data&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="like.PNG" style="width: 425px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Ltuc.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/138448i872C809978F3E8DF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Ltuc.png" alt="0683p000009Ltuc.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I want to apply&amp;nbsp;&lt;STRONG&gt;LIKE&lt;/STRONG&gt;&amp;nbsp;operator on NAME and CITY and &lt;STRONG&gt;Equal&lt;/STRONG&gt; operator on STATE and ZIP columns so that I can expect following output :&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Monroe Township, NJ &amp;nbsp;| &amp;nbsp;Monroe Township &amp;nbsp;| &amp;nbsp;NJ &amp;nbsp;| &amp;nbsp;08831 ........ i.e. First occurrence only.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I've tried tFilterRow component but don't know how to apply it for this requirement.Which component or steps or function should I apply to get this result ?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thanks !&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Jun 2017 08:58:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293026#M66045</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-23T08:58:38Z</dc:date>
    </item>
    <item>
      <title>Re: LIKE operator on same dataset</title>
      <link>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293027#M66046</link>
      <description>&lt;P&gt;This sounds like a tFuzzyMatch component solution IF you cannot define some logic to make this A LOT quicker and more accurate.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For example, you gave the following.....&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Monroe Township, NJ &amp;nbsp;| &amp;nbsp;Monroe Township &amp;nbsp;| &amp;nbsp;NJ &amp;nbsp;| &lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Now can you say that everything before a comma should match the second column? If so, just use String manipulation. It makes sense to consider applying some pre-processing rules to this before going down the tFUzzyMatch route.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Jun 2017 09:56:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293027#M66046</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-23T09:56:34Z</dc:date>
    </item>
    <item>
      <title>Re: LIKE operator on same dataset</title>
      <link>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293028#M66047</link>
      <description>No its not the case of everything before a comma.&lt;BR /&gt;tUniqueRow - I can use tUniqueRow, but it is based on only equality condition and not the LIKE kind of thing.&lt;BR /&gt;Same in the case of tAggregateRow.&lt;BR /&gt;&lt;BR /&gt;I simply want to group the table based on like condition instead of equality condition.&lt;BR /&gt;&lt;BR /&gt;Can we achieve it using tMap. I'm gonna give a try to it.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 23 Jun 2017 10:14:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293028#M66047</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-23T10:14:59Z</dc:date>
    </item>
    <item>
      <title>Re: LIKE operator on same dataset</title>
      <link>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293029#M66048</link>
      <description>&lt;P&gt;The problem you have is that you are assuming a "world knowledge" of a human. Computers can't work like that (here is where my AI degree comes into play &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; ).&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Consider the numbers 1 and 11. They are not "like" each other to us or a computer....they are very different. Now consider 1111111111111111111 and 11111111111111111111. To us (on first inspection) they look "like" each other...until we actually count the 1s. To a computer, they are different. They are massively different. Now if we change numbers to text, our brains automatically spot patterns. So the following text is seen as "the same".....&lt;BR /&gt;&lt;BR /&gt;Hello my name is Richard&lt;/P&gt; 
&lt;P&gt;Hello my name si Richard&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;We autocorrect (which is both good and bad), a computer won't. To a computer that is just a series of bits without a context. That is why "Like" is such a difficult task.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;It has been solved by many mechanisms, but they are not always very efficient or easy to implement in Data Integration. What I was suggesting was that you look for rules to apply. For example, if you make the Strings uppercase, remove leading and trailing spaces, etc. Once you have done that, then you *might* be able to use Java String functionality like "indexOf" (&lt;A href="https://docs.oracle.com/javase/7/docs/api/java/lang/String.html" target="_blank" rel="nofollow noopener noreferrer"&gt;https://docs.oracle.com/javase/7/docs/api/java/lang/String.html&lt;/A&gt;).&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;However, if you cannot apply these rules you may have to use Fuzzy Matching. This is a clever mechanism, but requires a lot of work to get it perfect...if you can get it "perfect" at all.&lt;/P&gt;</description>
      <pubDate>Fri, 23 Jun 2017 10:38:41 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/LIKE-operator-on-same-dataset/m-p/2293029#M66048</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-06-23T10:38:41Z</dc:date>
    </item>
  </channel>
</rss>

