Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hi Team,
I need to read a particular sheet/tab in one excel file having extension .xlsx in talend job using component tfileInputExcel component.
But getting below error (
Unable to recognize OLE stream). when i tried to change thisextension
to .xls its working. But i need to avoid changing the extension and read the files as it is as per the requirement.Should I use any other component to read such type of excel for a particular sheet/tab.Please help to resolve this issue.
Thanks,
Rita
Hi
You have to check the 'Read excel 2017 file format(xlsx) box if you read a .xlsx file.
Regards
Shong
If you have the problem of handling files with different types, you can use the custom components tFileExcelWorkbookOpen + tFileExcelSheetInput. These components detects the file type automatically and handle them always in the same way.
Hi Shong,
I had tried that by checking box 'Read excel 2017 file format(xlsx). But job is throwing this error in that case.
Cleaning up unclosed ZipFile for archive /var/opt/talend/SRC_CTMS/Output/EDMTestDev/Documents/MASTER_SUBJECT_VISIT_DELETES_V1.xlsx
Exception in thread "Thread-2" java.lang.OutOfMemoryError: GC overhead limit exceeded
Thanks,
Rita
the new Open XML based Excel format is very memory consuming. You can increase the memory of the job by adding or editing a JVM parameter to -Xmx2G
With "changing this extension" you mean you read the file with Excel and write it with the old OLE format back, right?
Simply renaming the file will not help and it potentially dangerous.
Hi Team,
By checking box 'Read excel 2017 file format(xlsx) and increasing JVM parameter to -Xmx2G job is accepting file data but performance wise it is very slow in loading data in snowflake table.Do I need to increase JVM parameter -Xms as well right now it is -Xms256M.
Or what can be done to increase the performance of the job in case of loading data.