A Production Data Science and
Engineering Platform

Apache Spark optimized
by the team that created it

Provision Spark clusters tuned and managed
by experts in a self-service manner.

Production Jobs
and Workflows

Automatically recover Spark clusters and jobs from failures without human intervention. Easily access detailed logs and debugging information.

Interactive Data Science

Multiply the effectiveness of your data science team with interactive notebooks, multi-user collaboration, GitHub integration,
and much more.

Broadly Compatible

Easily connect BI tools like Tableau via a secure SQL server
or integrate Databricks with your systems via APIs.

Cloud Native

Deploy in your AWS VPC and access your cloud data with
an optimized S3 access layer.

End-to-end Security

The only Spark platform secured by encryption, role-based access control, audit logs, and compliance standards.

We are excited about Databricks as the unified platform for data scientists, engineers, and analysts.

Chris D’Agostino, VP of Technology, Capital One
Viacom
NBC Universal
Capital One
MyFitnessPal
Live Nation

Sophisticated analytics at blazing speed

Community Driven

Benefit from the most active open source project in big data with over 1000+ contributors from 250+ organizations.

Unified Platform

Seamlessly blend ETL, interactive queries, machine learning, and streaming analytics using SQL, Python, Java, R, or Scala.

Rich Libraries

Built-in libraries for machine learning, graph computation, and more to simplify development of data-driven applications.