Skip to main content
Page 1

Writing a Faster Jsonnet Compiler

October 12, 2018 by Li Haoyi, Josh Rosen and Ahir Reddy in
This blog post is part of our series of internal engineering blogs on the Databricks platform, infrastructure management, integration, tooling, monitoring, and provisioning...

Top 5 Reasons for Choosing S3 over HDFS

At Databricks, our engineers guide thousands of organizations to define their big data and cloud strategies. When migrating big data workloads to the...

Introducing Redshift Data Source for Spark

October 19, 2015 by Sameer Wadkar and Josh Rosen in
This is a guest blog from Sameer Wadkar, Big Data Architect/Data Scientist at Axiomine. The Spark SQL Data Sources API was introduced in...

Project Tungsten: Bringing Apache Spark Closer to Bare Metal

April 28, 2015 by Reynold Xin and Josh Rosen in
In a previous blog post , we looked back and surveyed performance improvements made to Apache Spark in the past year. In this...