<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Operation with Large Excel file in Talend Studio</title>
    <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295857#M68581</link>
    <description>adding to above post.
&lt;BR /&gt;I also enabled the memory saving mode while using tFileExcelSheetInput and tFileExcelWorkbookOpen to open the excel files.
&lt;BR /&gt;Please help in advising on above post.</description>
    <pubDate>Thu, 25 Apr 2013 14:32:47 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2013-04-25T14:32:47Z</dc:date>
    <item>
      <title>Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295850#M68574</link>
      <description>Hello,
&lt;BR /&gt;I have an excel spreadsheet which has 5 Lac records and of size 37 MB ( It has roughly 78 columns). I am unable to create metadata for that file. Talend is throwing "Java heap space exception"
&lt;BR /&gt;I need to process that file in couple of jobs. Please help.</description>
      <pubDate>Sun, 21 Apr 2013 17:47:37 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295850#M68574</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-21T17:47:37Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295851#M68575</link>
      <description>Create a smaller file to read the metadata and use the original file for your import. 
&lt;BR /&gt;In case of you experiences java heap space errors while processing your file in the job, try alternative components to read Excel files (tFileExcelWorkbookOpen and tFileExcelSheetInput). The component tFileExcelWorkbookOpen has an memory saving mode (works only for the newer XLSX format). tFileExcelSheetInput needs tFileExcelWorkbookOpen.</description>
      <pubDate>Sun, 21 Apr 2013 18:04:10 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295851#M68575</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-21T18:04:10Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295852#M68576</link>
      <description>Yes, as mentioned by jlolling, Use a smaller file for creating metadata. If you do not have smaller file then you can probably open the XML file through some editor like Notepad++ or Edit plus and select only few records to create an example or sample file. 
&lt;BR /&gt;One you have created metadata then I do not think that you should be having any issues with reading the large XML file.</description>
      <pubDate>Tue, 23 Apr 2013 07:38:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295852#M68576</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-23T07:38:29Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295853#M68577</link>
      <description>Hi Jilolling and Vikram, 
&lt;BR /&gt;Yes am able to create metadata with a smaller file. Now the problem is with operations with the large excel files. 
&lt;BR /&gt;As suggested by you, I have used tFileExcelSheetInput and tFileExcelWorkbookOpen to open the excel files and able to read only two files. When the result of those two files is tried to join with third file, again am facing "java heap space exception".Please help.</description>
      <pubDate>Tue, 23 Apr 2013 13:56:29 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295853#M68577</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-23T13:56:29Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295854#M68578</link>
      <description>Can you post the screen shot of tMap where you are joining the two file?
&lt;BR /&gt;Also, if you can let us know the number of records from both the files.</description>
      <pubDate>Wed, 24 Apr 2013 08:42:59 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295854#M68578</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-24T08:42:59Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295855#M68579</link>
      <description>Did you switched on the memory saving mode or do you have to deal with the old OLE based format?
&lt;BR /&gt;If you have very large xls files, yes there is no way without adding more memory or you have to splitt the input file.
&lt;BR /&gt;You could also decide to load the data files without look ups into staging tables and do the job in the database.</description>
      <pubDate>Wed, 24 Apr 2013 21:55:32 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295855#M68579</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-24T21:55:32Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295856#M68580</link>
      <description>Hi,&lt;BR /&gt;Am using tjoin to join the xlsx file. Actually we are migrating old legacy data ( data in spread sheets) into our new oracle 10g database. Here I need to join the spreadsheets (xlsx files) and load the necessary columns into a single staging table and from there actual business logic is to be applied.&lt;BR /&gt;My problem is that I am uable to join those xlsx files only. Am getting "java heap space exception" while reading/joining a few of them. Can u please suggest me a way to getting the source data (data in xlsx) files into the staging table.</description>
      <pubDate>Thu, 25 Apr 2013 13:27:53 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295856#M68580</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-25T13:27:53Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295857#M68581</link>
      <description>adding to above post.
&lt;BR /&gt;I also enabled the memory saving mode while using tFileExcelSheetInput and tFileExcelWorkbookOpen to open the excel files.
&lt;BR /&gt;Please help in advising on above post.</description>
      <pubDate>Thu, 25 Apr 2013 14:32:47 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295857#M68581</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-25T14:32:47Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295858#M68582</link>
      <description>You can select the required fields from both the excel sheet load then to different csv files. As delimited files are lighter and easier to read.&lt;BR /&gt;Now read delimited files and join using tMap. Also do not forget to switch on store temporary data to disc setting in tMap.&lt;BR /&gt;Let me know if it helps.</description>
      <pubDate>Thu, 25 Apr 2013 18:11:20 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295858#M68582</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-25T18:11:20Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295859#M68583</link>
      <description>That is the part which I do not understand. To save the content from a large Excel file into a database you should not be a big deal!&lt;BR /&gt;I suggest loading all files - WITHOUT joining to anything else - at first in the database and THAN with these staging tables (one file == one staging table) doing your lookups in the database not between files.</description>
      <pubDate>Thu, 25 Apr 2013 20:39:49 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295859#M68583</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-25T20:39:49Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295860#M68584</link>
      <description>Hi,
&lt;BR /&gt;Your suggestion helped me.. Thanks a lot.
&lt;BR /&gt;I have loaded individual files on to individual tables first and later on joined them later on. Thanks once again.
&lt;BR /&gt;Just for information, file with large data got loaded only from UNIX box and only after increasing heap size in shell script file.</description>
      <pubDate>Mon, 29 Apr 2013 17:32:42 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295860#M68584</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-29T17:32:42Z</dc:date>
    </item>
    <item>
      <title>Re: Operation with Large Excel file</title>
      <link>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295861#M68585</link>
      <description>You can set any JVM parameters for the job in the Run view in the section JVM parameters. It is not necessary to edit the created shell scripts for this reason.</description>
      <pubDate>Mon, 29 Apr 2013 22:48:55 GMT</pubDate>
      <guid>https://community.qlik.com/t5/Talend-Studio/Operation-with-Large-Excel-file/m-p/2295861#M68585</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2013-04-29T22:48:55Z</dc:date>
    </item>
  </channel>
</rss>

