<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic OutOfMemoryError:GC overhead limit exceeded in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284133#M57891</link>
    <description>Hi, 
&lt;BR /&gt;Lets say i have two hdfs files A and B. File A with 40 columns. I am generating 10 million data for file A. While generating data for file B, i need to take random records from file A (like 7 or 8 columns only primary key - foreign key relationship). As i am giving 10 million data as lookup, i am getting GC overhead issue. I have 8GB RAM and i have given XMX till 8096. I tried with 4096 as well. 
&lt;BR /&gt; 
&lt;BR /&gt;Please give a solution to solve this issue / an alternate method to take random records. 
&lt;BR /&gt; 
&lt;BR /&gt;As i am generating data dynamically, i think breaking the files in to small files and fetching random records is not possible. Please clarify. 
&lt;BR /&gt; 
&lt;BR /&gt;I have used a temporary location also for tmap and increased buffer size too. But i couldn't find whether its working. Please let me know if there is a way to check whether temp data is getting stored in this location and there is a effect in increasing buffer size.</description>
    <pubDate>Sat, 16 Nov 2024 04:26:45 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2024-11-16T04:26:45Z</dc:date>
    <item>
      <title>OutOfMemoryError:GC overhead limit exceeded</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284133#M57891</link>
      <description>Hi, 
&lt;BR /&gt;Lets say i have two hdfs files A and B. File A with 40 columns. I am generating 10 million data for file A. While generating data for file B, i need to take random records from file A (like 7 or 8 columns only primary key - foreign key relationship). As i am giving 10 million data as lookup, i am getting GC overhead issue. I have 8GB RAM and i have given XMX till 8096. I tried with 4096 as well. 
&lt;BR /&gt; 
&lt;BR /&gt;Please give a solution to solve this issue / an alternate method to take random records. 
&lt;BR /&gt; 
&lt;BR /&gt;As i am generating data dynamically, i think breaking the files in to small files and fetching random records is not possible. Please clarify. 
&lt;BR /&gt; 
&lt;BR /&gt;I have used a temporary location also for tmap and increased buffer size too. But i couldn't find whether its working. Please let me know if there is a way to check whether temp data is getting stored in this location and there is a effect in increasing buffer size.</description>
      <pubDate>Sat, 16 Nov 2024 04:26:45 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284133#M57891</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-11-16T04:26:45Z</dc:date>
    </item>
    <item>
      <title>Re: OutOfMemoryError:GC overhead limit exceeded</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284134#M57892</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp; &amp;nbsp; When the lookup size is more, you will have to use Store temp data in disk option in tMap. The details can be referred from below link.&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;&lt;A href="https://help.talend.com/reader/EJfmjmfWqXUp5sadUwoGBA/J4xg5kxhK1afr7i7rFA65w" target="_blank" rel="nofollow noopener noreferrer"&gt;https://help.talend.com/reader/EJfmjmfWqXUp5sadUwoGBA/J4xg5kxhK1afr7i7rFA65w&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&amp;nbsp;&lt;/P&gt; 
&lt;P&gt;Warm Regards,&lt;BR /&gt;Nikhil Thampi&lt;/P&gt; 
&lt;P&gt;Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved&lt;/P&gt;</description>
      <pubDate>Thu, 10 Oct 2019 23:11:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284134#M57892</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-10-10T23:11:14Z</dc:date>
    </item>
    <item>
      <title>Re: OutOfMemoryError:GC overhead limit exceeded</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284135#M57893</link>
      <description>Hi Nikhil, 
&lt;BR /&gt; 
&lt;BR /&gt;Thank you for your suggestion. But as i have said already i used temporary location already for storing temp data. Still I get memory issue. XMX size of 8096 is not enough for this lookup but i cannot give higher xmx either. 
&lt;BR /&gt; 
&lt;BR /&gt;When i store temp data on disk, i am getting error message in my desktop too (Saying something like- " an application is using lot of space,close the program?")</description>
      <pubDate>Fri, 11 Oct 2019 04:08:03 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284135#M57893</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2019-10-11T04:08:03Z</dc:date>
    </item>
    <item>
      <title>Re: OutOfMemoryError:GC overhead limit exceeded</title>
      <link>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284136#M57894</link>
      <description>&lt;P&gt;&lt;A href="https://community.qlik.com/s/profile/0053p000007LQQEAA4"&gt;@lbhavya345&lt;/A&gt;&amp;nbsp;,you need to split the data and do the required operations otherwise need to increase the RAM size and do the process&lt;/P&gt;</description>
      <pubDate>Fri, 11 Oct 2019 11:41:43 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/OutOfMemoryError-GC-overhead-limit-exceeded/m-p/2284136#M57894</guid>
      <dc:creator>manodwhb</dc:creator>
      <dc:date>2019-10-11T11:41:43Z</dc:date>
    </item>
  </channel>
</rss>

