<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Performance Issue in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Performance-Issue/m-p/2230783#M21312</link>
    <description>I have issue with below design. It's occupying huge resource. How the performance can be improved? Kindly help me. 
&lt;BR /&gt; 
&lt;A href="https://community.talend.com/legacyfs/online/membersTempo/349921/blob_20160216-1312.png" target="_blank"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDId.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/141390i43186438DF8A4DC3/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDId.png" alt="0683p000009MDId.png" /&gt;&lt;/span&gt; &lt;/A&gt; 
&lt;BR /&gt;&amp;gt;&amp;gt;I can remove tmap3 and tmap4, but in order to avoid the unwanted number of columns in buffer, I didn't remove that. And also, i have used some filter condition in those tmaps. 
&lt;BR /&gt;I have 8 columns like A,B,C,D,E,F,G,H. Filtering the records on C,D,E,F,G(tmap3 &amp;amp; tmap4) and I'm taking only A,B,H to reference buffer(tmap1 &amp;amp; tmap2). Is it the right approach? Please correct me, if I'm wrong. 
&lt;BR /&gt;&amp;gt;&amp;gt;tFileInputDelimited_2 &amp;amp; tFileInputDelimited_3 were same files. If I extract that as a single file(one tFileDelimited), I cannot used that as a reference in two places. Is there is any approach to handle this? Extracting the file only once, will increase the performance. 
&lt;BR /&gt;&amp;gt;&amp;gt;Job was very resource consuming. I'm getting 3 million records from source &amp;amp; 2.5 million from each reference. Allocated Xmx16384. I cannot allocate this much RAM to a single job. Need help on this. 
&lt;BR /&gt;&amp;gt;&amp;gt;Sorting and Removing duplicates takes heavy time? I used sort on disk option. But still it's very resource consuming. Any other ways to do it efficiently? 
&lt;BR /&gt;Someone, please help me out.&amp;nbsp; 
&lt;BR /&gt;Thanks</description>
    <pubDate>Tue, 01 Mar 2016 16:23:08 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2016-03-01T16:23:08Z</dc:date>
    <item>
      <title>Performance Issue</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performance-Issue/m-p/2230783#M21312</link>
      <description>I have issue with below design. It's occupying huge resource. How the performance can be improved? Kindly help me. 
&lt;BR /&gt; 
&lt;A href="https://community.talend.com/legacyfs/online/membersTempo/349921/blob_20160216-1312.png" target="_blank"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDId.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/141390i43186438DF8A4DC3/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDId.png" alt="0683p000009MDId.png" /&gt;&lt;/span&gt; &lt;/A&gt; 
&lt;BR /&gt;&amp;gt;&amp;gt;I can remove tmap3 and tmap4, but in order to avoid the unwanted number of columns in buffer, I didn't remove that. And also, i have used some filter condition in those tmaps. 
&lt;BR /&gt;I have 8 columns like A,B,C,D,E,F,G,H. Filtering the records on C,D,E,F,G(tmap3 &amp;amp; tmap4) and I'm taking only A,B,H to reference buffer(tmap1 &amp;amp; tmap2). Is it the right approach? Please correct me, if I'm wrong. 
&lt;BR /&gt;&amp;gt;&amp;gt;tFileInputDelimited_2 &amp;amp; tFileInputDelimited_3 were same files. If I extract that as a single file(one tFileDelimited), I cannot used that as a reference in two places. Is there is any approach to handle this? Extracting the file only once, will increase the performance. 
&lt;BR /&gt;&amp;gt;&amp;gt;Job was very resource consuming. I'm getting 3 million records from source &amp;amp; 2.5 million from each reference. Allocated Xmx16384. I cannot allocate this much RAM to a single job. Need help on this. 
&lt;BR /&gt;&amp;gt;&amp;gt;Sorting and Removing duplicates takes heavy time? I used sort on disk option. But still it's very resource consuming. Any other ways to do it efficiently? 
&lt;BR /&gt;Someone, please help me out.&amp;nbsp; 
&lt;BR /&gt;Thanks</description>
      <pubDate>Tue, 01 Mar 2016 16:23:08 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performance-Issue/m-p/2230783#M21312</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-03-01T16:23:08Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Issue</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Performance-Issue/m-p/2230784#M21313</link>
      <description>Hi, 
&lt;BR /&gt;We have replied to your another topic: 
&lt;A href="https://community.qlik.com/s/feed/0D53p00007vCnbtCAC" target="_blank" rel="nofollow noopener noreferrer"&gt;https://community.talend.com/t5/Design-and-Development/Performance-issue-with-below-design/td-p/89491&lt;/A&gt;. 
&lt;BR /&gt;Could you please take a look at it? 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Wed, 02 Mar 2016 03:49:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Performance-Issue/m-p/2230784#M21313</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2016-03-02T03:49:53Z</dc:date>
    </item>
  </channel>
</rss>

