<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232008#M22117</link>
    <description>&lt;P&gt;I tried but looks like I am stuck. With the tUniqRow, I can split my dataset into unique and repeating bits. However, the tMap component won't let me compare them both. For some unknown reason, I can't seem to add both them to the tMap mode. How do I do that? Unless I am able to draw a comparison between the two splits, I won't be able to move around the data so that all repeating records are separated in one segment, and non repeating in another.&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LKj7AAG"&gt;@TRF&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 24 Aug 2018 15:46:08 GMT</pubDate>
    <dc:creator>sushantV</dc:creator>
    <dc:date>2018-08-24T15:46:08Z</dc:date>
    <item>
      <title>Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232006#M22115</link>
      <description>&lt;P&gt;I need to figure a way to split a dataset into two datasets, based on the data repetition in one of the fields. If a value is getting repeated, I need to put all the rows in one dataset and if if it's unique, that row should go to another.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;e.g. if I have the dataset with following records:&lt;/P&gt; 
&lt;P&gt;A1&lt;/P&gt; 
&lt;P&gt;A1&lt;/P&gt; 
&lt;P&gt;A2&lt;/P&gt; 
&lt;P&gt;A3&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;For the above dataset, the repeating records need to go to dataset 1 so that the dataset 1 contains&lt;/P&gt; 
&lt;P&gt;A1&lt;/P&gt; 
&lt;P&gt;A1&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;and non-repeating values need to go to another dataset so that the dataset 2 contains&lt;/P&gt; 
&lt;P&gt;A2&lt;/P&gt; 
&lt;P&gt;A3&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;If I use tUniqueRow, A1 would still go to the resultant dataset. I could make the unique dataset first, then do some sort of comparison with the remaining records and them add the row in the other dataset, remove the row from unique dataset. This is messy. Can anyone make a recommendation?&lt;/P&gt;</description>
      <pubDate>Wed, 22 Aug 2018 20:43:30 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232006#M22115</guid>
      <dc:creator>sushantV</dc:creator>
      <dc:date>2018-08-22T20:43:30Z</dc:date>
    </item>
    <item>
      <title>Re: Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232007#M22116</link>
      <description>You got it, tUniqRow + tMap to remove rows which exist as duplicates.</description>
      <pubDate>Wed, 22 Aug 2018 21:27:57 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232007#M22116</guid>
      <dc:creator>TRF</dc:creator>
      <dc:date>2018-08-22T21:27:57Z</dc:date>
    </item>
    <item>
      <title>Re: Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232008#M22117</link>
      <description>&lt;P&gt;I tried but looks like I am stuck. With the tUniqRow, I can split my dataset into unique and repeating bits. However, the tMap component won't let me compare them both. For some unknown reason, I can't seem to add both them to the tMap mode. How do I do that? Unless I am able to draw a comparison between the two splits, I won't be able to move around the data so that all repeating records are separated in one segment, and non repeating in another.&amp;nbsp;&lt;A href="https://community.qlik.com/s/profile/0053p000007LKj7AAG"&gt;@TRF&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Aug 2018 15:46:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232008#M22117</guid>
      <dc:creator>sushantV</dc:creator>
      <dc:date>2018-08-24T15:46:08Z</dc:date>
    </item>
    <item>
      <title>Re: Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232009#M22118</link>
      <description>After tUniqRow catch uniq rows into a tHashOutput (here called uniq) and catch replicated rows into an other tHashOutput (here called replicate).
&lt;BR /&gt;On the next subjob, use a tMap with "uniq" tHashInput for the main input and "replicate" for the lookup.
&lt;BR /&gt;Join both using an inner join and add 2 outputs for the tMap, 1 catches the rejected inner join rows (really uniq) and the other don't catch them.
&lt;BR /&gt;Redirect 1rst output to what you want and 2nd output to a tHashOutput linked to "replicate" (append mode is required).
&lt;BR /&gt;This new tHashOutput should contain the subset of replicated records.
&lt;BR /&gt;Hope you got it.</description>
      <pubDate>Fri, 24 Aug 2018 17:21:33 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232009#M22118</guid>
      <dc:creator>TRF</dc:creator>
      <dc:date>2018-08-24T17:21:33Z</dc:date>
    </item>
    <item>
      <title>Re: Need recommendation on best way to split dataset into two - one with repeating records and another with non repeating records</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232010#M22119</link>
      <description>&lt;P&gt;I am sure I got lost somewhere in the middle. Let me do some reading, before attempting this solution.&lt;/P&gt;</description>
      <pubDate>Fri, 24 Aug 2018 17:47:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Need-recommendation-on-best-way-to-split-dataset-into-two-one/m-p/2232010#M22119</guid>
      <dc:creator>sushantV</dc:creator>
      <dc:date>2018-08-24T17:47:08Z</dc:date>
    </item>
  </channel>
</rss>

