SESSION
Data Warehouse Performance on the Data Lakehouse
OVERVIEW
EXPERIENCE | In Person |
---|---|
TYPE | Lightning Talk |
TRACK | Data Lakehouse Architecture |
INDUSTRY | Media and Entertainment |
TECHNOLOGIES | Apache Spark, Delta Lake, SQL Analytics / BI / Visualizations |
SKILL LEVEL | Intermediate |
DURATION | 20 min |
DOWNLOAD SESSION SLIDES |
Data lakehouses promise flexibility, scalability, and cost-effectiveness but often fail to deliver these benefits due to the shortcomings of query engines. This has forced users to copy their data from the lakehouse into proprietary data warehouses to achieve their desired query performance—through a complex, costly ingestion pipeline that undermines data governance and freshness. In this talk, we will dive into the latest developments in data lakehouse querying and how you can ensure your data lakehouse realizes its full potential. This talk will cover:
- Why you should avoid using proprietary data warehouses purely for accelerating queries
- The latest technical developments in query engines that will empower data lakehouse performance
- Coinbase's data architecture with Databricks Lakehouse and StarRocks