Skip to main content

Top 10 Apache Spark Blog Posts from 2016

Engineering blog
Updated: April 15, 2024
Published: December 30, 2016
Open SourceLess than a minute

Spark Summit will be held in Dublin, Ireland on Oct 24-26, 2017. Check out the get your ticket before it sells out!


Here’s our recap of what has transpired with Apache Spark since our previous digest. This digest includes Apache Spark’s top ten 2016 blogs, along with release announcements and other noteworthy events.

Top Ten Apache Spark Blogs

  1. Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop
  2. A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets
  3. Introducing Apache Spark Datasets
  4. Introducing GraphFrames
  5. Introducing Apache Spark 2.0
  6. Structured Streaming In Apache Spark
  7. Apache Spark 2.0 Preview: Machine Learning Model Persistence
  8. Apache Spark @Scale: A 60 TB+ production use case from Facebook
  9. Scalable Partition Handling for Cloud-Native Architecture in Apache Spark 2.1
  10. Deep Learning on Databricks

Releases

Webinar

Events

What’s Next

To stay abreast with what’s happening with Apache Spark, follow us on Twitter @databricks and visit SparkHub.

Never miss a Databricks post

Subscribe to the categories you care about and get the latest posts delivered to your inbox