Delta and Databricks as a Cost-Effective, Exabyte-Scale, Real-Time Web Application Backend
Overview
Experience | In Person |
---|---|
Type | Lightning Talk |
Track | Data Lakehouse Architecture and Implementation |
Industry | Enterprise Technology, Financial Services |
Technologies | Apache Spark, Delta Lake |
Skill Level | Intermediate |
Duration | 20 min |
The Delta Lake architecture promises to provide a single, highly functional, and high-scale copy of data that can be leveraged by a variety of tools to satisfy a broad range of use cases. To date, most use cases have focused on interactive data warehousing, ETL, model training, and streaming. Real-time access is generally delegated to costly and sometimes difficult-to-scale NoSQL, indexed storage, and domain-specific specialty solutions, which provide limited functionality compared to Spark on Delta Lake.
In this session, we will explore the Delta data-skipping and optimization model and discuss how Capital One leveraged it along with Databricks photon and Spark Connect to implement a real-time web application backend. We’ll share how we built a highly-functional and performant security information and event management user experience (SIEM UX) that is cost effective.
Session Speakers
IMAGE COMING SOON
Scott Schenkein
/VP, Distinguished Engineer
Capital One Financial