Session

Trust You Can Measure: Data Quality Standards in The Lakehouse

Overview

ExperienceIn Person
TypeBreakout
TrackData and AI Governance
IndustryEnterprise Technology, Professional Services
TechnologiesApache Spark, Unity Catalog
Skill LevelBeginner
Duration40 min

Do you trust your data? If you’ve ever struggled to figure out which datasets are reliable, well-governed, or safe to use, you’re not alone. At Databricks, our own internal lakehouse faced the same challenge—hundreds of thousands of tables, but no easy way to tell which data met quality standards. In this talk, the Databricks Data Platform team shares how we tackled this problem by building the Data Governance Score—a way to systematically measure and surface trust signals across the entire lakehouse. You’ll learn how we leverage Unity Catalog, governed tags, and enforcement to drive better data decisions at scale. Whether you're a data engineer, platform owner, or business leader, you’ll leave with practical ideas on how to raise the bar for data quality and trust in your own data ecosystem.

Session Speakers

Amit Pahwa

/Staff Software Engineer
Databricks

Sergiy Kanyshchev

/Staff Software Engineer
Databricks