<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Duplicate and Rejected Records - Is there a better way? in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244659#M30736</link>
    <description>&lt;P&gt;Hi ,&lt;/P&gt;&lt;P&gt;tmap should not be the choice for tasks that could be achieved using other ways because it is a complex component carrying&lt;/P&gt;&lt;P&gt;so much&amp;nbsp;options in itself but that comes at cost of performance.&lt;/P&gt;&lt;P&gt;you can use javaflex where&amp;nbsp;map is used if the only requirement is to change data flow for the dboutput component.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chandra Kant&lt;/P&gt;</description>
    <pubDate>Wed, 14 Nov 2018 11:58:29 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2018-11-14T11:58:29Z</dc:date>
    <item>
      <title>Duplicate and Rejected Records - Is there a better way?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244658#M30735</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I have built a process flow to extract from a CSV file - all fields are brought in as strings,&lt;/P&gt; 
&lt;P&gt;runs through a tConvertType where certain fields are converted to Integers, if they fail the conversion then they are filtered through a tMap and inserted into the Reject table.&amp;nbsp;&lt;SPAN&gt;If there are any rejected&amp;nbsp;records then an email is sent.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;Duplicates are then caught using the tUniqRow, and any duplicate records are filtered through a tMap and inserted into the Reject table.&amp;nbsp;&lt;SPAN&gt;If there are any duplicate records then an email is sent.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;All valid records are then inserted into the output table.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Although this process works, I am sure it is not the most efficient way to process this data, is anyone able to provide any suggestions on how to clean this up, and perhaps remove some unnecessary components / steps?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="ExtractLoad.png" style="width: 934px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009M14v.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156183i5B15ED61187626DD/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009M14v.png" alt="0683p000009M14v.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 07:18:44 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244658#M30735</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T07:18:44Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate and Rejected Records - Is there a better way?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244659#M30736</link>
      <description>&lt;P&gt;Hi ,&lt;/P&gt;&lt;P&gt;tmap should not be the choice for tasks that could be achieved using other ways because it is a complex component carrying&lt;/P&gt;&lt;P&gt;so much&amp;nbsp;options in itself but that comes at cost of performance.&lt;/P&gt;&lt;P&gt;you can use javaflex where&amp;nbsp;map is used if the only requirement is to change data flow for the dboutput component.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chandra Kant&lt;/P&gt;</description>
      <pubDate>Wed, 14 Nov 2018 11:58:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244659#M30736</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2018-11-14T11:58:29Z</dc:date>
    </item>
    <item>
      <title>Re: Duplicate and Rejected Records - Is there a better way?</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244660#M30737</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can avoid using tmap over here as you are not doing any filtration or any expression check so instead you can use tjavarow component to process the data..&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards&lt;/P&gt;
&lt;P&gt;Ganshyam Patel&lt;/P&gt;</description>
      <pubDate>Fri, 16 Nov 2018 12:02:15 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Duplicate-and-Rejected-Records-Is-there-a-better-way/m-p/2244660#M30737</guid>
      <dc:creator>Ganshyam</dc:creator>
      <dc:date>2018-11-16T12:02:15Z</dc:date>
    </item>
  </channel>
</rss>

