Discussion Board for collaboration related to QlikView App Development.
Is there a known clean-up / data quality functionality that exists in QLikView ? would you have a reference to example clean-up scripts for the data ?
Clean-up is for sales / revenue data, incl.
* remove duplicates
* find and handle quasi-duplicates
...
Would you more generally have a hint at macros (Excel, others), or algos that perform such clean-up ?
Thanks
The best way to get better quality data is to perform cleanups in the source system. Always.
Otherwise you'll build BI solutions full of exception handling code. Which is not a pretty sight and may rapidly become a nightmare to maintain.
However, sometimes you're stuck with data that cannot be changed anymore (general ledgers, transaction tables, historical exchange rates and such), especially in ERP systems. IMHO such situations require ad-hoc solutions, tailored to keep the corrective actions efficient and almost invisible.
Also, some code you add to your scripts will not necessarily correct the data, but will bring bad data to the attention of the people responsible. When they correct their mistakes, the code will automatically become "inactive" (because of no more bad values).
Some examples:
Best,
Peter
Please post some sample data so community will help you more.