Compose for DW: Update all the stats on DW and DM tables
Currently Compose does not have a mechanism to provide its customers the options on how the statistics should be collected on each of the tables generated via Compose (Hubs, Satellites, Facts and Dimensions).
Providing this options would help to configure, update different statistics on each of the tables created via Compose.
Updating all the statistics on the table would not only help in performance improvement during loads but will also help in efficient query optimization as part of data virtualization or visualization.
I would add to this and request that the TSTG_ tables be analyzed. Since the TSTG tables are truncated as opposed to dropped, stats are gathered when there are 0 rows in the table and are not updated when they are populated with data.
I set this to partially delivered due to the common related to Oracle.
(The request did not specify a DW platform)
For Azure Synapse - creation and update of statistics for DWH and DM tables has been delivered fully with stats being created on full load, and updated with sampling based on MSFT best practices for incremental load.
For this ideation request, can you please help me with the exact process step number in the DWH and DM task statements/instructions where these are implemented?
I believe these are implemented for Hubs, Satellites, Dimensions and Facts. So for every table in this category, there should be a process number or process step assigned to them in the set of instructions for the DWH or DM Task statements which will update statistics after the table is loaded.
I'm looking for those process steps so I can verify them in the instructions post generation.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.