Unlock a world of possibilities! Login now and discover the exclusive benefits awaiting you.
Hi,
After Data warehouse completed successfully, I want to delete data from the staging table.
Now in my scenario, after 20 days the employee's name 'xxx' department changed to 'yyy'.Now when I run the staging job(to delete old records) it deletes data from Employee and Department. After that incremental load for staging is run, It fetches data from Department table as data is modified (based on the 'Modified Date' column) but for Employee table we didn't get any data as there is no modification( i.e; Employee table Modified date = NULL) because data was already deleted.
My view query is something like this :
Left Join tbl_Dept D on E.Empid= D.Empid
So for Dimension table, while fetching data from the staging view, I didn't get proper data as an employee whose department was changed that information is available in the Department table but Employee table has no data. When I apply join in view using EmployeeId to fetch data it gives NULL as Department has data but Employee has no data for that EmployeeId.
I don't have any clue how to solve this issue.
Thanks,
Dhara
Hello,
Could you please have a look at talend CDC feature to see if it is what you are looking for?
https://help.talend.com/reader/WN4QKUFj9JViEWZYDWqNIA/N89jQafnjg2FeGfBtJ6Jfg
Best regards
Sabrina
Hi,
I am using Talend Open Studio, CDC feature is available in Talend Enterprise version I suppose.
Thanks,
Dhara
Hello,
The CDC feature is available in talend subscription solution not open source.
With talend open studio, If you want to capture the changed data and only load these changed data into target table to achieve table sync, you can compare tables by using tMap.
The work flow should be:
Target DB-->tMap(make inner join on your input and set the "Catch lookup inner join reject" as true)-->output
Source DB-->
The output will be the changed data.
Best regards
Sabrina