[resolved] How to improve performance for the job?
I have a job to load data from flat file to oracle DB. The flat file is processed many times, once per each dimension (to insert new dimensions), once to load fact. Each dimension process uses tMap to get unique new dimension and tOracleOutput to insert records. Last fact process also uses tMap to get dimension Ids and then tOracleOutput to insert new ones.
The performance is ok when there is no/little data in DB. But it downgrades sharply as more data in DB. The flat file is from 50M to 1G, with 50K records at least. Any suggestions on improving the performance?
What's the best practice for this kind of data loading to keep a good performance?
Hi,
Have you tried to check out "Use Batch " option in toracleOutput component to activate the batch mode for data processing?
Would you mind posting your job design screenshots into forum?
Best regards
Sabrina