Do not input private or sensitive data. View Qlik Privacy & Cookie Policy.
Skip to main content

Announcements
Streamlining user types in Qlik Cloud capacity-based subscriptions: Read the Details
cancel
Showing results for 
Search instead for 
Did you mean: 
kwlee
Contributor II
Contributor II

How to continue replicating CDC to target after removal of source table in task and subsequently adding it back?

Hi, I have a task replicating 5 tables, sending to Event hub for downstream processing into target database.

Task setting as below: 

Full load processing is ON
Change processing: Apply Changes Processing is ON.

I have removed a table from the task at 3pm a day ago. Target table and data still exists in target, as at 3pm.

Now I need to add back the table to the task and resume CDC from 3pm yesterday. I do not want a reload of the re-added table to happen. And the other 4 tables should resume CDC when the task is started back.

Is below the correct way to do it? 

1. Stop task
2. Select the removed table from source. Apply transformation as required.
3. From Run button, select Advanced Run Options
4. Select " Tables are already loaded. Start processing changes from 'yesterday 2:55pm'

Labels (2)
3 Solutions

Accepted Solutions
eva55anderson
Contributor
Contributor

The correct way to achieve this is to add the table back to the task, and instead of choosing "Advanced Run Options," you would set the "Start processing changes from" to "yesterday 2:55pm." This will resume CDC from the specified time for the re-added Milestone Mastercard table without triggering a full reload, while the other tables will also resume their CDC processing.

 

View solution in original post

Dana_Baldwin
Support
Support

Hi @kwlee 

You are correct. The thing to note is that since the other tables in the task continued processing changes after you removed the one, when you start the task from timestamp after adding it back in you may get PK constraint violation errors & etc. on the tables that continued processing changes since yesterday. This error handing may slow performance until the task catches up.

Thanks,

Dana

View solution in original post

lisefipumom
Contributor
Contributor

Oh, I get what you’re trying to do here! So you want to add the table back to the task without triggering a full reload and without affecting the other tables that should continue with CDC. Based on your steps, here’s my take on it:

Stop the task – This makes sense to stop everything before making changes.

Select the removed table from the source – Yes, you should add the table back and apply any necessary transformations to match your setup.

Use Advanced Run Options – This is the key part. Selecting the option to start processing changes from "yesterday 2:55pm" seems like the right approach to avoid a reload. That will ensure CDC picks up from the last change without overwriting data.

Don’t forget to ensure CDC for the other 4 tables continues – When you restart the task, it should pick up where it left off for the other tables.

So yes, your approach looks good to me! Just double-check the timestamp and make sure the settings for CDC are aligned to ensure you’re resuming from the right point. It's like adjusting the flow in a system—small tweaks, but critical to keeping everything running smoothly without any hiccups!

View solution in original post

6 Replies
eva55anderson
Contributor
Contributor

The correct way to achieve this is to add the table back to the task, and instead of choosing "Advanced Run Options," you would set the "Start processing changes from" to "yesterday 2:55pm." This will resume CDC from the specified time for the re-added Milestone Mastercard table without triggering a full reload, while the other tables will also resume their CDC processing.

 

Dana_Baldwin
Support
Support

Hi @kwlee 

You are correct. The thing to note is that since the other tables in the task continued processing changes after you removed the one, when you start the task from timestamp after adding it back in you may get PK constraint violation errors & etc. on the tables that continued processing changes since yesterday. This error handing may slow performance until the task catches up.

Thanks,

Dana

lisefipumom
Contributor
Contributor

Oh, I get what you’re trying to do here! So you want to add the table back to the task without triggering a full reload and without affecting the other tables that should continue with CDC. Based on your steps, here’s my take on it:

Stop the task – This makes sense to stop everything before making changes.

Select the removed table from the source – Yes, you should add the table back and apply any necessary transformations to match your setup.

Use Advanced Run Options – This is the key part. Selecting the option to start processing changes from "yesterday 2:55pm" seems like the right approach to avoid a reload. That will ensure CDC picks up from the last change without overwriting data.

Don’t forget to ensure CDC for the other 4 tables continues – When you restart the task, it should pick up where it left off for the other tables.

So yes, your approach looks good to me! Just double-check the timestamp and make sure the settings for CDC are aligned to ensure you’re resuming from the right point. It's like adjusting the flow in a system—small tweaks, but critical to keeping everything running smoothly without any hiccups!

kwlee
Contributor II
Contributor II
Author

Thank you Eva55anderson.

kwlee
Contributor II
Contributor II
Author

Thank you Dana_Baldwin.

kwlee
Contributor II
Contributor II
Author

Thank you lisefipumom