Session

Entity Resolution for the Best Outcomes on Your Data

Overview

ExperienceIn Person
TypeBreakout
TrackArtificial Intelligence
IndustryHealth and Life Sciences, Manufacturing
TechnologiesMLFlow, Mosaic AI, Databricks Apps
Skill LevelIntermediate
Duration40 min

There are many ways to implement entity resolution (ER) system — both using vendor software and open-source libraries that enable DIY Entity Resolution. However, generally we see common challenges with any approach — scalability, bound to a single model architecture, lack of metrics and explainability, and stagnant implementations that do not "learn" with experience. Recent experiments with transformer-based approaches, fast lookups with vector search and Databricks components such as Databricks Apps and Agent Eval provide the foundations for a composable ER system that can get better with time on your data. In this presentation, we include a demo of how to use these components to build a composable ER that has the best outcomes for your data.

Session Speakers

IMAGE COMING SOON

Yinxi Zhang

/Staff Data Scientist
Databricks

IMAGE COMING SOON

Ninad Sohoni

/DSA
Databricks