Skip to main content
Announcements
Global Transformation Awards! Applications are now open. Submit Entry
cancel
Showing results for 
Search instead for 
Did you mean: 
jerome_dal
Contributor II
Contributor II

Optimizing tSqoopImport on Oracle database

Hello,

I use the tSqoopImport component to load Oracle data into a parquet file in Cloudera Hadoop platform (v7.1.6).

With Talend v7.3.1.

I did several tries changing parameters to optimize the data load, but without success.

This component proposes several parameters in the standard properties, but I would like to configure other parameters as per the Sqoop documentation. I did not find the right syntax for that. For instance for "-Doracle.row.fetch.size=50000" or "--fetch-size=20000".

Any idea how to configure correctly these parameters?

 

And for the standard "Direct" check box, I guess that corresponds to the "--direct" parameter. As per exemples found on the net, that should increase performance for Oracle data loading, but that did no do anything in my platform. How can I ensure this parameter is taken into account?

Do I need to use another jdbc library (I use "mvn:org.talend.libraries/ojdbc7/6.0.0/jar") ?

Labels (4)
0 Replies