Master Data Management: Building the Golden Record

In this article, we will learn how to build the “Golden Record” in MDM using an unsupervised linking model Splink embedded within Databricks ARC

Manoj Kukreja
AWS in Plain English
6 min readFeb 11, 2024

--

Before we run through a sample scenario, let us try to understand what the golden record aka the single version of truth record means. In a typical organization, the recent upsurge of big data has created a few unique challenges related to data quality:

--

--

Author, Big Data Engineering, Data Science, Data Lakes, Cloud Computing and IT security specialist.