<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: processing large JSON file error: OutOfMemory in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293996#M66915</link>
    <description>Hi, 
&lt;BR /&gt;For csv file, the answer is yes, use tFileInputFullRow to read the source file row by row and generate several files with the option 'Split the files into several files' on tFileOutputDelimited. 
&lt;BR /&gt;The job looks like: 
&lt;BR /&gt;tFileInputFullRow--main--tFileOutputDelimited 
&lt;BR /&gt;See pics for details 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDWh.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/152570i0C452A1DBE86FC5E/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDWh.png" alt="0683p000009MDWh.png" /&gt;&lt;/span&gt;</description>
    <pubDate>Mon, 28 Jan 2013 06:08:22 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-01-28T06:08:22Z</dc:date>
    <item>
      <title>processing large JSON file error: OutOfMemory</title>
      <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293993#M66912</link>
      <description>Hi, I am new to Talend Open Studio for Data Integration. I was able to manage creating few test jobs successfully. Now I am running into an issue: when the tInputFileJSON reads a big file (80M), it will run out of memory. I have made some JVM changes to increase heap size, but still running into the same problem. As a matter of fact, I am not surprise by that as I can only increase heap size so much, but potential input data is "unlimited". 
&lt;BR /&gt;Just wonder how I can control the job to read say 5000 rows and process the data before bringing next 5000 row? what is the best practice to process large file in Talend?</description>
      <pubDate>Mon, 28 Jan 2013 00:04:26 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293993#M66912</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-01-28T00:04:26Z</dc:date>
    </item>
    <item>
      <title>Re: processing large JSON file error: OutOfMemory</title>
      <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293994#M66913</link>
      <description>Hi, 
&lt;BR /&gt;In Talend, the input component such as tFileInputDelimited, tFileInputJson read all rows and cache them into memory, and then iterate one by one. So, it is impossible to process the first N rows, then next the N rows. 
&lt;BR /&gt;DO you have some other components like tMap, tFilterRow which consumes more memory in the job? For OutOfMemory exception, we are ususlly required to allocate more memory to the job,please see this KB article: 
&lt;BR /&gt; 
&lt;A href="https://community.qlik.com/s/article/ka03p0000006EZuAAM" target="_blank"&gt;outOfMemory&lt;/A&gt; 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina</description>
      <pubDate>Mon, 28 Jan 2013 02:19:14 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293994#M66913</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-01-28T02:19:14Z</dc:date>
    </item>
    <item>
      <title>Re: processing large JSON file error: OutOfMemory</title>
      <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293995#M66914</link>
      <description>Thank you Sabrina! My JSON is pretty complicated and file size will keep increasing. So I don't think keeping allocating more memory is good for me in long run. Is there a way I can split a big file into multiple smaller files by the value of primary key? For example, I have a CSV with (device_id, billing_date, billing_amount), can I save all rows for device_id 111222 to file 111222.csv, and all rows for device_id 111333 to 111333.csv?</description>
      <pubDate>Mon, 28 Jan 2013 04:58:54 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293995#M66914</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-01-28T04:58:54Z</dc:date>
    </item>
    <item>
      <title>Re: processing large JSON file error: OutOfMemory</title>
      <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293996#M66915</link>
      <description>Hi, 
&lt;BR /&gt;For csv file, the answer is yes, use tFileInputFullRow to read the source file row by row and generate several files with the option 'Split the files into several files' on tFileOutputDelimited. 
&lt;BR /&gt;The job looks like: 
&lt;BR /&gt;tFileInputFullRow--main--tFileOutputDelimited 
&lt;BR /&gt;See pics for details 
&lt;BR /&gt;Best regards 
&lt;BR /&gt;Sabrina 
&lt;BR /&gt; 
&lt;span class="lia-inline-image-display-wrapper" image-alt="0683p000009MDWh.png"&gt;&lt;img src="https://community.qlik.com/t5/image/serverpage/image-id/152570i0C452A1DBE86FC5E/image-size/large?v=v2&amp;amp;px=999" role="button" title="0683p000009MDWh.png" alt="0683p000009MDWh.png" /&gt;&lt;/span&gt;</description>
      <pubDate>Mon, 28 Jan 2013 06:08:22 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293996#M66915</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-01-28T06:08:22Z</dc:date>
    </item>
    <item>
      <title>Re: processing large JSON file error: OutOfMemory</title>
      <link>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293997#M66916</link>
      <description>&lt;P&gt;Has anyone solved this , I have the same issue .&lt;/P&gt; 
&lt;P&gt;JSON file is&amp;nbsp; &amp;gt; 200mb and talend goes out of memory while parsing it&lt;/P&gt;</description>
      <pubDate>Sat, 23 Feb 2019 13:34:36 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/processing-large-JSON-file-error-OutOfMemory/m-p/2293997#M66916</guid>
      <dc:creator>karandama2006</dc:creator>
      <dc:date>2019-02-23T13:34:36Z</dc:date>
    </item>
  </channel>
</rss>

