HomepageData + AI Summit 2022 Logo
JUNE 27-30, 2022

June 27–30, 2022


Healthcare and Life Sciences

Register Now

Welcome data teams and executives in the Healthcare and Life Sciences industry! This year’s Data + AI Summit is jam-packed with talks, demos and discussions on the biggest innovations in patient care and drug R&D.

To help you take full advantage of the Healthcare and Life Sciences experience at Summit, we’ve curated all the programs in one place.

Highlights at this year’s Summit:

  • Healthcare and Life Sciences Industry Forum: Our capstone event for Healthcare and Life Sciences attendees at Summit featuring keynotes and panel discussions with Walgreens, Takeda, Optum, and Humana followed by networking. More details in the agenda below. 
  • Healthcare and Life Sciences Lounge: Stop by our industry lounge located outside the Expo floor to meet with Databricks’ industry experts and see solutions from our partners including ZS Associates, John Snow Labs and others.
  • Session Talks: Over 10 technical talks on topics including healthcare NLP, knowledge graphs for R&D, commercial analytics, and predicting hospital readmissions.

The full list of Healthcare and Life Sciences sessions, talks and demos can be found in the agenda below

Featured Speakers

Headshot of Luigi Guadagno

Luigi Guadagno

GVP, Pharmacy Healthcare Technology Platform

Walgreens Boots Alliance

Curated Sessions


10:45 AM

10:45 AM-11:20 AM
Journey to Solving Healthcare Price Transparency with Databricks and Delta Lake
Centers for Medicare & Medicaid Services (CMS) published Price Transparency mandate for health care service providers and payers to adhere to publish the cost of services provided based on procedure codes on public domain. This enabled us to create a comprehensive solution that can process tens of Terabytes data combined to create Machine Readable Files in the form JSON files and host them on public domain....
Headshot of Ross Silberquit
Ross Silberquit
Headshot of Narayanan Hariharasubramanian
Narayanan Hariharasubramanian
See Details
10:45 AM-11:20 AM
Turning Big Biology Data into Insights on Disease – The Power of Circulating Biomarkers
Profiling small molecules in human blood across global populations gives rise to a greater understanding of the varied biological pathways and processes that contribute to human health and diseases. Herein, we describe the development of a comprehensive Human Biology Database, derived from nontargeted molecular profiling of over 300,000 human blood samples from individuals across diverse backgrounds, demogr...
Headshot of Tao Long
Tao Long

11:30 AM

11:30 AM-12:05 PM
The Semantics of Biology—Vaccine and Drug Research with Knowledge Graphs and Logical Inferencing on Apache Spark
From the organization of the tree of life, to the tissues and structures of living organisms: trees and graphs are a recurring data structure in biology. Given the tree-like relationships between biological entities, Knowledge Graphs are emerging as the ideal way to store and retrieve biological data.

In our first Data + AI talk (https://www.youtube.com/watch?v=Kj5bZ2afWSU), we presented the Be...
Headshot of John Hunter
John Hunter
11:30 AM-12:05 PM
Modern Architecture of a Cloud-Enabled Data and Analytics Platform
As part of a strategic decision to adopt cloud and to make data FAIR ( findable, accessible, interoperable, and findable ) the data team at Bayer Enabling Functions decided to build a new data and analytical platform using Data Lakehouse as the core element of that platform. The platform was built to support the creation of Data Products utilizing certain key architectural principles in mind such as providi...
Headshot of Naghman Waheed
Naghman Waheed
Headshot of Dieter Krug
Dieter Krug
11:30 AM-12:05 PM
Lessons Learned from Deidentifying 700 Million Patient Notes
Providence embarked on an ambitious journey to de-identify all our clinical electronic medical record (EMR) data to support medical research and the development of novel treatments. This talk shares how this was done for patient notes and how you can achieve the same.

First, we built a deidentification pipeline using pre-trained deep learning models, fine-tuned to our own data. We then devel...
Headshot of Nadaa Taiyab
Nadaa Taiyab
Headshot of Lindsay Mico
Lindsay Mico
Providence Health

2:05 PM

2:05 PM-2:40 PM
Comprehensive Patient Data Self-Serve Environment and Executive Dashboards Leveraging Databricks and Elasticsearch Processes
Unstructured clinical notes are critical for understanding the complexities of patient response to oncology treatment. As drug labels and diagnostic criteria change over time, combining unstructured clinical documents with EMR and Claims data yields a comprehensive and evolving view of a cancer patient.

At OncoHealth, we leverage tools in Databricks to synthesize these data into a unified data...
2:05 PM-2:40 PM
Accelerating the Pace of Autism Diagnosis with Machine Learning Models
A formal autism diagnosis can be an inefficient and lengthy process. Families may wait months or longer before receiving a diagnosis for their child despite evidence that earlier intervention leads to better treatment outcomes. Digital technologies which detect the presence of behaviors related to autism can scale access to pediatric diagnoses. This work aims to demonstrate the feasibility of deep learning ...
Headshot of Anish Lakkapragada
Anish Lakkapragada
Lynbrook High School / Stanford University

2:50 PM

2:50 PM-3:25 PM
Building Enterprise Scale Data and Analytics Platforms at Amgen
Over the past few years, Amgen have developed a suite of modern enterprise platforms that have served as a core foundational capability for data & analytics transformation for our business functions. We operate in mature agile teams with a dedicated product team for each of our platforms to build reusable capabilities and integrating with business programs in line with SAFe. We have massive business impact ...
Headshot of Deepak Abburi
Deepak Abburi
Headshot of Vickye Jain
Vickye Jain
ZS Associates

4:00 PM

4:00 PM-4:35 PM
Amgen’s Journey To Building a Global 360 View of its Customers with the Lakehouse
Serving patients in over 100 countries, Amgen is a leading global biotech company focused on developing therapies that have the power to save lives. Delivering on this mission requires our commercial teams to regularly meet with healthcare providers to discuss new treatments that can help patients in need. With the onset of the pandemic, where face-to-face interactions with doctors and other Healthcare Prov...
Headshot of Scott Hirayama
Scott Hirayama
Headshot of Bin Yuan
Bin Yuan


3:30 PM

Industry Forum
3:30 PM-5:00 PM
Healthcare and Life Sciences Industry Forum
Join our capstone event for Healthcare and Life Sciences attendees at Summit for keynotes, panel discussions and presentations on the biggest trends in the industry.

Welcome Address: Lakehouse for Healthcare & Life Sciences
• Michael Sanky, Global Industry Lead, Healthcare & Life Sciences, Databricks
Keynote: Journey To Delivering Real-time Pharmacy Insights at Walgreens wit...
Headshot of Michael Sanky
Michael Sanky
Headshot of Miguel Martinez
Miguel Martinez
And six more

5:00 PM

Industry Forum
5:00 PM-6:30 PM
Healthcare and Life Sciences Industry Forum Reception
Join us for a chance to network with other Healthcare and Life Sciences executives. Enjoy some refreshments and learn about our key partners who have built solutions for tackling high impact use cases on the lakehouse.


9:15 AM

9:15 AM-9:50 AM
Predicting Repeat Admissions to Substance Abuse Treatment with Machine Learning
In our presentation, we will walk through a model created to predict repeat admissions to substance abuse treatment centers. The goal is to predict early who will be at high risk for relapse so care can be tailored to put additional focus on these patients. We used the Treatment Episode Data Set (TEDS) Admissions data set, which includes every publicly funded substance abuse treatment admission in the US.
Headshot of Kelsey Emnett
Kelsey Emnett
Kimberly Clark
Headshot of Jennifer Morizzo
Jennifer Morizzo

10:45 AM

10:45 AM-11:20 AM
X-FIPE: eXtended Feature Impact for Prediction Explanation
Many enterprises have built their own machine learning platforms in the cloud using Databricks, e.g. Humana FlorenceAI. In order to effectively drive the adoption of predictive models in daily business operations, data scientists and business teams need to work closely to make sure they serve the consumer needs in compliance with regulatory rules. Model interpretability is key. In this talk, we would like t...
Headshot of Xingde Jiang
Xingde Jiang
Headshot of Steve Brunner
Steve Brunner



Vision AI—Animal Health Industry Use Cases Using Databricks on Azure
Vision AI and Azure Cognitive services can be applied in a variety of ways for healthcare, especially for Animal Health. Animal Diagnostics market size is valued at over USD 4.5 Billion in 2020 and is expected to grow at CAGR of 8.5% from 2021 to 2027(Markets&Markets Study).
The overall livestock advanced monitoring market is expected to grow from USD 1.4 billion in 2021 to USD 2.3 billion by 2026; it...
Headshot of VirooPax Mirji
VirooPax Mirji
Disrupting the Prescription Drug Market with AI and Data
The US prescription drug market has well-known issues; to name a few: overpriced prescription drugs—on average, 2.56 times higher than those in other countries—that many patients cannot afford; complete lack of pricing transparency as the results of convoluted drug distribution systems and having Pharmacy Benefit Managers (PBMs) as the middlemen; razor-thin margins for pharmacies that struggle to stay in bu...
Headshot of Luyuan Fang
Luyuan Fang
Prescryptive Health

Register today for Data + AI Summit to take advantage of all these amazing Healthcare and Life Sciences sessions and networking opportunities.

Register today to save your spot

Register now