Cutting Costs, Not Performance: Optimizing Databricks at Scale
Overview
Experience | In Person |
---|---|
Type | Breakout |
Track | Data Lakehouse Architecture and Implementation |
Industry | Energy and Utilities, Public Sector, Travel and Hospitality |
Technologies | Databricks SQL, Databricks Workflows, Unity Catalog |
Skill Level | Advanced |
Duration | 40 min |
As Databricks transforms data processing, analytics and machine learning, managing platform costs has become crucial for organizations aiming to maximize value while staying within budget. While Databricks offers unmatched scalability and performance, inefficient usage can lead to unexpected cost overruns.
This presentation will explore common challenges organizations face in controlling Databricks costs and provide actionable best practices for optimizing resource allocation, preventing over-provisioning and eliminating underutilization.
Drawing from NTT DATA’s experience, I'll share how we reduced Databricks costs by up to 50% through strategies like choosing the right compute resource, leveraging manage tables and using Unity Catalog features, such as system tables, to monitor consumption.
Join this session to gain practical insights and tools that will empower your team to optimize Databricks without overspending.
Session Speakers
Pedro Ferreira
/Project Manager
NTTDATA
Artur Simões
/Lead Engineer
NTT DATA