Why Databricks?

The complete solution for data scientists and engineers.

Effortlessly manage large-scale Spark clusters

Spin up and scale out clusters to hundreds of nodes and beyond with just a few clicks, without IT or DevOps. Easily harness the power of Spark for streaming, machine learning, graph processing, and more.

Accelerate your work with an interactive workspace

Work interactively while automatically documenting your progress in notebooks — in R, Python, Scala, or SQL. Visualize data in just a few clicks, and use familiar tools like matplotlib, ggplot or d3.

Run your production jobs at scale

Put new applications in production with one click by scheduling either notebooks or JARs. Monitor the progress of production jobs and set up automated alerts to notify you of changes.

Collaborate interactively

Seamlessly share notebooks, collaborate in the same code base, comment on each other’s work, and track activities.

Publish your analysis with customized dashboards

Build and articulate your findings in dashboards in a few clicks.
Set up dashboards to update automatically through jobs.

Connect your favorite apps

Run your favorite BI tools or sophisticated third-party applications on Databricks.

Learn more about Databricks

What some of our customers are saying

Apache Spark

Apache Spark: sophisticated analytics at blazing speed.

Unified Platform

Seamlessly blend ETL, interactive queries, machine learning and streaming analytics using SQL, Python, Java, R, or Scala.

Community-Driven

Benefit from the most active open source project in big data with over 500+ contributors from 200+ organizations.

Rich Libraries

Built-in libraries for machine learning, graph
computation and more to simplify development of data-driven applications.

We’re hiring

Join Databricks to work on some of the world’s most challenging Big Data problems.

Upcoming Events

@databricks

Follow @databricks