Master Data Management: Building the Golden Record
In this article, we will learn how to build the “Golden Record” in MDM using an unsupervised linking model Splink embedded within Databricks ARC
Published in
6 min readFeb 11, 2024
Before we run through a sample scenario, let us try to understand what the golden record aka the single version of truth record means. In a typical organization, the recent upsurge of big data has created a few unique challenges related to data quality: