Understanding and mastering the various Microsoft SQL Server Tools available for Data Quality , Master Data Management , Fuzzy Matching and Enterprise Information Management can be daunting and exasperating. In the hopes of reducing the anxiety and frustration I am providing a practical roadmap using openly available resources and requiring only a few days to complete.
I have reviewed and organized several articles and blog post that would allow you within a few days to master the skills required to utilize several SQL Server 2012 capabilities mentioned above.
From a business perspective where to be covering the following:
Master Data Management (MDS)
Data Quality (DQS)
Fuzzy Matching(data Deduplication)
Data Profiling (Source analysis against business rules)
Master Data Services
Nick Barclay: BI-Lingual: MDS Architecture Notes
This article will provide overview of the MDS architecture in support of MDM (Master Data Management)
Nick Barclay: BI-Lingual: Beginning Master Data Services (Part 1 thru 7)
This article will teach you the steps for implementing Microsoft MDS as well as the tasks required to develop and load a Model. Complete all exercises in order. MDS is the tool Microsoft provide to support creating and maintaining reference data set (Lookup and Code tables) in support of Master Data Management.
Data Quality Services
Enterprise Information Management using SSIS, MDS, and DQS Together [Tutorial]
This next article will teach you how to implement and utilize the Data Quality cleansing and matching capabilities.
Fuzzy Matching and Deduplication
Advanced SSIS Fuzzy Matching via Record Linkage Methodology – SQLServerCentral
This article will teach you the concepts and methodology recommended for Fuzzy Matching or Deduplication, in the context of a well known Record Linkage Methodology.
Advanced Matching and Data Profiling
I have also included the next two articles to further explore the code and capabilities to solve complex matching and deduplication efforts
Roll Your Own Fuzzy Match / Grouping (Jaro Winkler) – T-SQL – SQLServerCentral
Roll Your Own SSIS Fuzzy Matching / Grouping SSIS (Jaro – Winkler) – SQLServerCentral
Creating a Metadata Mart via TSQL – Complete Data Profiling Kit – Download
Great list, thanks a lot for posting it. I found this video and the author’s blog very informative as an introduction to data profiling. http://technet.microsoft.com/en-us/sqlserver/ff686909.aspx
Oracle has a good article too, though it is of course product specific and from ‘the other team’, but it didn’t hurt reading it and I learned a lot from it too .
thank you for your work