Skip to main content

Spark Summit will be held in Dublin, Ireland on Oct 24-26, 2017. Check out the get your ticket before it sells out!


Here’s our recap of what has transpired with Apache Spark since our previous digest. This digest includes Apache Spark’s top ten 2016 blogs, along with release announcements and other noteworthy events.

Top Ten Apache Spark Blogs

  1. Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop
  2. A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets
  3. Introducing Apache Spark Datasets
  4. Introducing GraphFrames
  5. Introducing Apache Spark 2.0
  6. Structured Streaming In Apache Spark
  7. Apache Spark 2.0 Preview: Machine Learning Model Persistence
  8. Apache Spark @Scale: A 60 TB+ production use case from Facebook
  9. Scalable Partition Handling for Cloud-Native Architecture in Apache Spark 2.1
  10. Deep Learning on Databricks

Releases

Webinar

Events

What’s Next

To stay abreast with what’s happening with Apache Spark, follow us on Twitter @databricks and visit SparkHub.

Try Databricks for free

Related posts

10th Spark Summit Sets Another Record of Attendance

June 9, 2017 by Jules Damji and Wayne Chan in
We have assembled a selected collage of highlights from Databricks’ speakers at our 10th Spark Summit, a milestone for Apache Spark community and...

Spark Summit EU 2017 Recap and Reflections

November 5, 2017 by Jules Damji in
“Dublin is now a truly cosmopolitan capital, with an influx of people, energy, and ideas infusing the ever-beguiling, multi-layered city with fresh flavors...

Databricks and Apache Spark 2016 Year in Review

Spark Summit will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it sells...
See all Engineering Blog posts