Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hello Talend Community,
I’m experiencing a data loss issue in an incremental load job in Talend and would appreciate your insights to identify the root cause and recommend solutions. Here’s a detailed overview of my setup, the problem, and the possibilities I’ve considered:
I have a Talend job designed to handle three flows: full load, incremental load, and custom load, triggered by user-defined start and end dates. The focus of my issue is the incremental load, which processes data from an Oracle SaaS source (via SOAP API calls) and writes to an Oracle ATP target. The job flow is as follows:
DELETE FROM "+context.table_name+" t
WHERE NOT EXISTS (
SELECT 1
FROM "+context.table_name+"_L l
WHERE t.FULFILL_LINE_ID = l.FULFILL_LINE_ID
)AutoCommit is enabled for all database operations, ensuring each transaction commits immediately.
Problem:
I’m experiencing data loss in the incremental load, with the following specific observations:
The job fails if the insert/update flow errors, stopping subsequent steps (temp table write and delete), but the data loss persists even when the job completes successfully.
Job Design:
Thanks,
Mohan
Hello Mohan,
The data loss observed during incremental load from Oracle ATP using CDC is typically related to Oracle LogMiner behavior and redo log availability, rather than a Talend defect.
CDC relies on Oracle redo/archived logs. If the required logs are purged, overwritten, or unavailable during the capture window, LogMiner cannot read all changes, which can result in missing records in the CDC output.
Recommended :
Ensure sufficient redo or archive log retention so all required logs remain available for the CDC window.
Run CDC more frequently to reduce the risk of logs aging out.
Verify supplemental logging is enabled on all source tables.
Monitor LogMiner access and dictionary availability during CDC runs.
On Oracle ATP, log retention policies are managed by the service and may shorten the available CDC window. This behavior is expected and should be accounted for in the CDC design.
Thanks,
Gourav