<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic OutofMemory Exception - Heap space + GC overlaod in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348667#M115759</link>
    <description>&lt;P&gt;Hello all,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am currently working with large data and trying to produce an xml file as final output.&lt;BR /&gt;The tests were successful with sample data; however with large data, i am encountering the exception below :&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;EM&gt;&lt;FONT color="#FF0000"&gt;- java.lang.OutOfMemoryError: Java heap space&lt;/FONT&gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;&lt;FONT color="#FF0000"&gt;- java.lang.OutOfMemoryError: GC overhead limit exceeded&lt;/FONT&gt;&lt;/EM&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;In fact, I got 3 csv files in my job.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;standard : 88,151 rows (main)&lt;BR /&gt;personal : 5,900,000 rows (lookup)&lt;BR /&gt;address : 230,000 rows (lookup)&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;1 standard row is linked with 75 personal rows and 15 address rows approx.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;First of all, i have tried to use a thashoutput to keep the data in memory to see how it processes.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="option1.PNG" style="width: 786px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lr63.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/146749iE756CC07D94E2AEF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lr63.png" alt="0683p000009Lr63.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Secondly i have also tried to generate lookup file in delimited (csv) rather than keeping it in memory:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="option2.PNG" style="width: 799px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lr9B.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/148589i6F356C33DCF01FE2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lr9B.png" alt="0683p000009Lr9B.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please note that i cannot use temp directory storage to do the lookup since i am using txmlmap; this option is not available&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;From investigation, i have also tried to increase the JVM arguments:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;PC RAM : 8 Gb&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-Xmx4096M&lt;/P&gt; 
&lt;P&gt;-Xms2048M&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;As a result only 9,500 over&amp;nbsp;&lt;SPAN&gt;88,151 were processed and thus ending with the mentioned outofmemory exception.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can you advice or propose me something propose? Thank you.&lt;/P&gt;</description>
    <pubDate>Sat, 16 Nov 2024 09:16:34 GMT</pubDate>
    <dc:creator>RA6</dc:creator>
    <dc:date>2024-11-16T09:16:34Z</dc:date>
    <item>
      <title>OutofMemory Exception - Heap space + GC overlaod</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348667#M115759</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;I am currently working with large data and trying to produce an xml file as final output.&lt;BR /&gt;The tests were successful with sample data; however with large data, i am encountering the exception below :&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;EM&gt;&lt;FONT color="#FF0000"&gt;- java.lang.OutOfMemoryError: Java heap space&lt;/FONT&gt;&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;&lt;FONT color="#FF0000"&gt;- java.lang.OutOfMemoryError: GC overhead limit exceeded&lt;/FONT&gt;&lt;/EM&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;In fact, I got 3 csv files in my job.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;standard : 88,151 rows (main)&lt;BR /&gt;personal : 5,900,000 rows (lookup)&lt;BR /&gt;address : 230,000 rows (lookup)&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;1 standard row is linked with 75 personal rows and 15 address rows approx.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;First of all, i have tried to use a thashoutput to keep the data in memory to see how it processes.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="option1.PNG" style="width: 786px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lr63.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/146749iE756CC07D94E2AEF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lr63.png" alt="0683p000009Lr63.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Secondly i have also tried to generate lookup file in delimited (csv) rather than keeping it in memory:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="option2.PNG" style="width: 799px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lr9B.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/148589i6F356C33DCF01FE2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lr9B.png" alt="0683p000009Lr9B.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Please note that i cannot use temp directory storage to do the lookup since i am using txmlmap; this option is not available&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;From investigation, i have also tried to increase the JVM arguments:&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;PC RAM : 8 Gb&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;-Xmx4096M&lt;/P&gt; 
&lt;P&gt;-Xms2048M&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;As a result only 9,500 over&amp;nbsp;&lt;SPAN&gt;88,151 were processed and thus ending with the mentioned outofmemory exception.&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can you advice or propose me something propose? Thank you.&lt;/P&gt;</description>
      <pubDate>Sat, 16 Nov 2024 09:16:34 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348667#M115759</guid>
      <dc:creator>RA6</dc:creator>
      <dc:date>2024-11-16T09:16:34Z</dc:date>
    </item>
    <item>
      <title>Re: OutofMemory Exception - Heap space + GC overlaod</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348668#M115760</link>
      <description>&lt;P&gt;This isn't guaranteed to work, but it might help. You say that you cannot use temp directory storage because you are using a tXMLMap. Could you try joining your data in a tMap (and using the temp directory storage), releasing the memory used by the tHash components (by ticking "Clear cache after reading"), filtering your joined data set to just the essential data, then outputting that to a new tHash. Then in another subjob build the XML with the tXMLMap, reading from the tHash.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;EDIT: One more thing I just remembered (maybe try this first before the other changes), set the "custom the flush buffer size" setting to something like 1000 rows (and experiment). Otherwise the whole data set will end up in memory before it is written to the file.&lt;/P&gt;</description>
      <pubDate>Tue, 19 Sep 2017 09:57:38 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348668#M115760</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-19T09:57:38Z</dc:date>
    </item>
    <item>
      <title>Re: OutofMemory Exception - Heap space + GC overlaod</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348669#M115761</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Thank you for your reply.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can you please confirm me if working directly with csv files is faster than using memory storage (tbuffer, thash)?&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;If so, i am trying to do the lookup using csv files and storing them in temp directory just before the txmlmap.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Hope it works.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="df.PNG" style="width: 454px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lqx2.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/154132i73A746C41773A9E1/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lqx2.png" alt="0683p000009Lqx2.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 20 Sep 2017 09:11:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348669#M115761</guid>
      <dc:creator>RA6</dc:creator>
      <dc:date>2017-09-20T09:11:36Z</dc:date>
    </item>
    <item>
      <title>Re: OutofMemory Exception - Heap space + GC overlaod</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348670#M115762</link>
      <description>&lt;P&gt;I have now the following error while trying to convert the xml document to string :&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capturedf.PNG" style="width: 775px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lqs3.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/156630i32AC02F3C92141C2/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lqs3.png" alt="0683p000009Lqs3.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;SPAN class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="sdfs.PNG" style="width: 945px;"&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009Lqwh.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/135339i89DBC036644343FF/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009Lqwh.png" alt="0683p000009Lqwh.png" /&gt;&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Can you advice or propose me some solution please?&lt;/P&gt;</description>
      <pubDate>Wed, 20 Sep 2017 09:31:18 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348670#M115762</guid>
      <dc:creator>RA6</dc:creator>
      <dc:date>2017-09-20T09:31:18Z</dc:date>
    </item>
    <item>
      <title>Re: OutofMemory Exception - Heap space + GC overlaod</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348671#M115763</link>
      <description>Why are you converting the document to a String? Is this entirely necessary? This is going to be an expensive process (memory wise) and if you can avoid it, it would be better.
&lt;BR /&gt;
&lt;BR /&gt;In terms of CSV vs memory, memory is quicker. But you are struggling with memory at the moment, so solve that problem first then look at making it faster.</description>
      <pubDate>Wed, 20 Sep 2017 11:55:05 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutofMemory-Exception-Heap-space-GC-overlaod/m-p/2348671#M115763</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2017-09-20T11:55:05Z</dc:date>
    </item>
  </channel>
</rss>

