<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data duplication handling - scenario in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252496#M36124</link>
    <description>&lt;P&gt;Have u tried use tUniqueRow component before tMap?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I think what u need is something like this?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 902px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2S1.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/140016i764F5B6ACEF841D4/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2S1.png" alt="0683p000009M2S1.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The desired output should be something like this&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 08 Feb 2019 13:06:36 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2019-02-08T13:06:36Z</dc:date>
    <item>
      <title>Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252493#M36121</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can I get solution for the below scenario.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am getting two records for a single product, only difference is one column value, but I need both values to be loaded into target on different columns. Duplication removal won't help me.&lt;/P&gt; 
&lt;P&gt;&lt;EM&gt;It would be great if I get an idea of optimal solution, because huge number of records coming from source.&lt;/EM&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Talend version: Open Studio for Big Data 7.0.1&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt; 
&lt;P&gt;&lt;EM&gt;&lt;STRONG&gt;Target: Salesforce&lt;/STRONG&gt;&lt;/EM&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;source:&lt;/P&gt; 
&lt;TABLE&gt; 
 &lt;TBODY&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;STRONG&gt;ProductName&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;ImageType&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;imageurl&lt;/STRONG&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 610&amp;nbsp;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;LARGE&lt;/TD&gt; 
   &lt;TD&gt;/devices/generic-phone.png&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 610&amp;nbsp;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;SMALL&lt;/TD&gt; 
   &lt;TD&gt;/devices/5145.jpg&lt;/TD&gt; 
  &lt;/TR&gt; 
 &lt;/TBODY&gt; 
&lt;/TABLE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT color="#ff0000"&gt;&lt;STRONG&gt;Required Data on Target: (based on 'Image Type' above)&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt; 
&lt;TABLE&gt; 
 &lt;TBODY&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;ProductName&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;main_image_URL&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;thumbnail_image_URL&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;Nokia 610&amp;nbsp;&lt;/FONT&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;/devices/generic-phone.png&lt;/FONT&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;FONT color="#0000ff"&gt;/devices/5145.jpg&lt;/FONT&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
 &lt;/TBODY&gt; 
&lt;/TABLE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I tried by writing expression in tmap, but output is not as expected (below), it will create duplicate in target&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;TABLE&gt; 
 &lt;TBODY&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;STRONG&gt;ProductName&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;STRONG&gt;main_image&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&lt;STRONG&gt;thumbnail_image&lt;/STRONG&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 8110&lt;/TD&gt; 
   &lt;TD&gt;/devices/generic-phone.png&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 8110&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;/TD&gt; 
   &lt;TD&gt;/devices/5145.jpg&lt;/TD&gt; 
  &lt;/TR&gt; 
 &lt;/TBODY&gt; 
&lt;/TABLE&gt;</description>
      <pubDate>Sat, 16 Nov 2024 06:38:21 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252493#M36121</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T06:38:21Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252494#M36122</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; Why don't you take it as two data sets at the bginning and then do an inner join?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Dataset one :- where ImageType ="LARGE"&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;TABLE&gt; 
 &lt;TBODY&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;STRONG&gt;ProductName&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;ImageType&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;imageurl&lt;/STRONG&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 610&amp;nbsp;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;LARGE&lt;/TD&gt; 
   &lt;TD&gt;/devices/generic-phone.png&lt;/TD&gt; 
  &lt;/TR&gt; 
 &lt;/TBODY&gt; 
&lt;/TABLE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Dataset two:- where ImageType="SMALL"&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;TABLE&gt; 
 &lt;TBODY&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;&lt;STRONG&gt;ProductName&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;ImageType&lt;/STRONG&gt;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;&lt;STRONG&gt;imageurl&lt;/STRONG&gt;&lt;/TD&gt; 
  &lt;/TR&gt; 
  &lt;TR&gt; 
   &lt;TD&gt;Nokia 610&amp;nbsp;&lt;/TD&gt; 
   &lt;TD&gt;&amp;nbsp;SMALL&lt;/TD&gt; 
   &lt;TD&gt;/devices/5145.jpg&lt;/TD&gt; 
  &lt;/TR&gt; 
 &lt;/TBODY&gt; 
&lt;/TABLE&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Now, do inner join based on Product Name and map the values two output flow in tMap as two variables.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Mapping in the tMap&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;ProductName -&amp;gt;&amp;nbsp;ProductName&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;imageurl(small) -&amp;gt;&amp;nbsp;&lt;FONT color="#0000FF"&gt;thumbnail_image_URL&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;&lt;STRONG&gt;imageurl(big) -&amp;gt;&amp;nbsp;&lt;FONT color="#0000FF"&gt;main_image_URL&lt;BR /&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;FONT color="#000000"&gt;This should give the desired output.&lt;/FONT&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 08 Feb 2019 12:39:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252494#M36122</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-08T12:39:59Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252495#M36123</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LLzpAAG"&gt;@Vibin_CT&lt;/A&gt;&amp;nbsp;,check below job.&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Untitled.png" style="width: 866px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M1wP.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/139444i86B0161677082CF4/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M1wP.png" alt="0683p000009M1wP.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Untitled.png" style="width: 878px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M30b.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/129391iDAC46EB26262726F/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M30b.png" alt="0683p000009M30b.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Untitled.png" style="width: 871px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M30g.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156553i5A9C8F1B969F9239/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M30g.png" alt="0683p000009M30g.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Untitled.png" style="width: 999px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2wV.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/144228i689F87E745F0555E/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2wV.png" alt="0683p000009M2wV.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 08 Feb 2019 12:50:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252495#M36123</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2019-02-08T12:50:29Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252496#M36124</link>
      <description>&lt;P&gt;Have u tried use tUniqueRow component before tMap?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I think what u need is something like this?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 902px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M2S1.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/140016i764F5B6ACEF841D4/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M2S1.png" alt="0683p000009M2S1.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;The desired output should be something like this&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 08 Feb 2019 13:06:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252496#M36124</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-08T13:06:36Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252497#M36125</link>
      <description>&lt;P&gt;Hi &lt;A href="https://community.qlik.com/s/profile/00539000006eKOgAAM"&gt;@nthampi&lt;/A&gt; ,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thank you very much for your solution.&lt;/P&gt; 
&lt;P&gt;In-order to use your method, I want to load data into a intermediate database table, because the problem which I mentioned is not directly from source data, it is an intermediate data coming after doing so many transformations and I was unable to load this data into MySQL db(staging) due to MySQL table size limitation (some columns contains appended data and size is huge). So I am using &lt;STRONG&gt;thashoutput&lt;/STRONG&gt; component.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am also planning to use &lt;STRONG&gt;tfileoutputdelimited&lt;/STRONG&gt; instead of &lt;STRONG&gt;thashoutput&lt;/STRONG&gt; due to huge number of records and record size. &lt;FONT color="#FF6600"&gt;Can you suggest me, which component is better to use by considering memory and performance.&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 11 Feb 2019 09:20:00 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252497#M36125</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-11T09:20:00Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252498#M36126</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; Considering your use case, park the data as interim file using tfileinputdelimited.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;Also increase the memory parameters (Xms and Xmx) of the job for better job performance.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 11 Feb 2019 09:23:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252498#M36126</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-11T09:23:45Z</dc:date>
    </item>
    <item>
      <title>Re: Data duplication handling - scenario</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252499#M36127</link>
      <description>&lt;P&gt;Thanks &lt;A href="https://community.qlik.com/s/profile/00539000006eKOgAAM"&gt;@nthampi&lt;/A&gt; !!&lt;/P&gt;</description>
      <pubDate>Mon, 11 Feb 2019 10:35:12 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Data-duplication-handling-scenario/m-p/2252499#M36127</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-02-11T10:35:12Z</dc:date>
    </item>
  </channel>
</rss>

