Showing posts with label Data Quality. Show all posts
Showing posts with label Data Quality. Show all posts

Wednesday, March 23, 2011

Automating Data Quality

The below summary has been created from Microsoft Webcast

How do you build a reusable automated data quality solution that can be implemented with minimum cost and effort?

First ensure management and customer commitment

Start E A R L Y!


















Microsoft Technologies available for ensuring Data Quality


















What can be automated?


















How can I identify data outliers or even know what the data outliers are?
Problem: We can't write SQL to identify issues we don't know exists.


















Clustering Data Mining Algorithm is implemented in the Data Mining Excel Addin. For smaller data sets this addin can be used to identify the outliers. For more information, go here