Skip to main content
Announcements
Qlik Connect 2025! Join us in Orlando join us for 3 days of immersive learning: REGISTER TODAY

Talend Studio: OutOfMemory Exception

No ratings
cancel
Showing results for 
Search instead for 
Did you mean: 
TalendSolutionExpert
Contributor II

Talend Studio: OutOfMemory Exception

Last Update:

Jan 22, 2024 9:35:30 PM

Updated By:

Jamie_Gregory

Created date:

Apr 1, 2021 6:24:10 AM

During a Talend Job execution, an OutOfMemory Java exception may occur. This exception means that the Job ran out of memory. This article explains the common reasons for memory overflow that leads to the OutOfMemory Java exception, and offers some solutions and best practice.

There are several reasons for an OutOfMemory Java exception to occur. Most common reasons for it include:

  • Running a Job that contains a number of buffer components, such as tSortRow, tFilterRow, tMap, tAggregateRow, and tHashOutput.
  • Running a Job that processes a very large amount of data.

The error displays on the console as shown below:

Exception in thread "Thread-0" java.lang.OutOfMemoryError: Java heap space
    at java.util.LinkedList.listIterator(Unknown Source)
    at java.util.AbstractList.listIterator(Unknown Source)
    at java.util.AbstractSequentialList.iterator(Unknown Source)
    at routines.system.RunStat.sendMessages(RunStat.java:261)
    at routines.system.RunStat.run(RunStat.java:225)
    at java.lang.Thread.run(Unknown Source)

Resolution

This issue can usually be solved by optimizing the job as described in the following section. However, if the Job cannot be optimized or if the problem still exists despite the Job having been optimized, then you need to allocate more memory for the Job to be able to process large amounts of data. This article also provides some advice on how to allocate more memory. However note that the default memory setting should be sufficient for all normal Job executions.

Job design optimization

There are different ways of optimizing your Job Designs in order to improve the performance. A number of components include some native parameters to help you optimize your Jobs.

In Jobs that contain buffer components such as tSortRow as well as tMap, you can change the basic configuration to store temporary data on disk rather than in memory. For example, on tMap, click the tMap settings panel of Lookup flow and select the Store temp data option as true.

0693p000008uB58AAE.png

Then, set a directory path for temp data in the Basic settings panel.

0693p000008uAoyAAE.png

You can also optimize your Jobs that use a tMysqlInput component. In this case, select the option Enable stream in the Advanced settings panel to replace the usual buffering processing with a stream processing that allows the code to read from a large table without consuming a large amount of memory. This will optimize the performance.

Allocating more memory to one or several Jobs

If you cannot optimize the Job design, you can at least allocate more memory to the Job. The following sections describe how to allocate more memory to one Job via the Studio, to all Jobs via the Studio, or to a Job script outside the Studio.

One Job in Talend Studio

  1. Open the Job to which you want to allocate more memory.
  2. In the Run view, open the Advanced Settings tab and select the Use specific JVM arguments box.
  3. Allocate more memory to the active Job by double-clicking the default JVM arguments and editing them:

    0693p000008uB5DAAU.png

Note: This change only applies for the active Job. The JVM settings will persist in the job script, and take effect when the job is exported and is executed outside Talend Studio.

All Jobs in Talend Studio

  1. In Talend Studio, click the Window menu, then select Preferences.
  2. Under Preferences, expand the Talend node, and selectRun/Debug.
  3. Allocate more memory by editing the JVM parameters in the Job Run VM arguments table:

    0693p000008uAoZAAU.png

Note: This change is effective for all Jobs.

The default setting should be sufficient for all normal Job executions, therefore this global change of JVM arguments is not recommended.

Job scripts outside the Studio

If you have exported the Job as a Job script, or you only have access to the Job script, you can allocate more memory by modifying the script file. To do so:

  1. Open the script file: yourJobName_run.bat or yourJobName_run.sh.
  2. Modify the JVM arguments (-Xms and -Xmx) as follows:

    %~d0
     cd %~dp0
    java -Xms256M -Xmx2048M -cp classpath.jar; shong.test_0_1.test --context=Default %*
Labels (2)