Skip to main content
Engineering blog

Spark Summit will be held in Dublin, Ireland on Oct 24-26, 2017. Check out the get your ticket before it sells out!


Here’s our recap of what has transpired with Apache Spark since our previous digest. This digest includes Apache Spark’s top ten 2016 blogs, along with release announcements and other noteworthy events.

Top Ten Apache Spark Blogs

  1. Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop
  2. A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets
  3. Introducing Apache Spark Datasets
  4. Introducing GraphFrames
  5. Introducing Apache Spark 2.0
  6. Structured Streaming In Apache Spark
  7. Apache Spark 2.0 Preview: Machine Learning Model Persistence
  8. Apache Spark @Scale: A 60 TB+ production use case from Facebook
  9. Scalable Partition Handling for Cloud-Native Architecture in Apache Spark 2.1
  10. Deep Learning on Databricks

Releases

Webinar

Events

What’s Next

To stay abreast with what’s happening with Apache Spark, follow us on Twitter @databricks and visit SparkHub.