Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Join us in Toronto Sept 9th for Qlik's AI Reality Tour! Register Now
cancel
Showing results for 
Search instead for 
Did you mean: 
Ritz05
Contributor
Contributor

Unable to recognize OLE stream for .XLSX in tfileInputexcel

Hi Team,

I need to read a particular sheet/tab in one excel file having extension .xlsx in talend job using component tfileInputExcel component.

But getting below error (

Unable to recognize OLE stream). when i tried to change this

extension

to .xls its working. But i need to avoid changing the extension and read the files as it is as per the requirement.

Should I use any other component to read such type of excel for a particular sheet/tab.Please help to resolve this issue.

Thanks,

Rita

Labels (3)
6 Replies
Anonymous
Not applicable

Hi

You have to check the 'Read excel 2017 file format(xlsx) box if you read a .xlsx file.

 

Regards

Shong

jlolling
Creator III
Creator III

If you have the problem of handling files with different types, you can use the custom components tFileExcelWorkbookOpen + tFileExcelSheetInput. These components detects the file type automatically and handle them always in the same way.

Ritz05
Contributor
Contributor
Author

Hi Shong,

I had tried that by checking box 'Read excel 2017 file format(xlsx). But job is throwing this error in that case.

 

Cleaning up unclosed ZipFile for archive /var/opt/talend/SRC_CTMS/Output/EDMTestDev/Documents/MASTER_SUBJECT_VISIT_DELETES_V1.xlsx

Exception in thread "Thread-2" java.lang.OutOfMemoryError: GC overhead limit exceeded

 

Thanks,

Rita

jlolling
Creator III
Creator III

the new Open XML based Excel format is very memory consuming. You can increase the memory of the job by adding or editing a JVM parameter to -Xmx2G

jlolling
Creator III
Creator III

With "changing this extension" you mean you read the file with Excel and write it with the old OLE format back, right?

Simply renaming the file will not help and it potentially dangerous.

Ritz05
Contributor
Contributor
Author

Hi Team,

 

By checking box 'Read excel 2017 file format(xlsx) and increasing JVM parameter  to -Xmx2G job is accepting file data but performance wise it is very slow in loading data in snowflake table.Do I need to increase JVM parameter -Xms as well right now it is -Xms256M.

 

Or what can be done to increase the performance of the job in case of loading data.