We have been experiencing performance issue with Compose CDC, as it takes longer time to begin the data load from landing to storage zone. On analysis we could find, as the number of Replicate CDC partition files increase the time taken for Compose CDC also increases. The suggestion that we received was to compress these partition files and merge as a single file. It is also recommended to perform this compression activity atleast bi weekly, so that we have an optimized performance on Compose.
Currently this requires a series of steps to be run manually on each of the Replicate tasks. It will be nice to have this feature automated as a task, similar to Compactor in Compose, so that we are able to a schedule it and save a lot of manual efforts.