Apache Spark the Fastest Open Source Engine for Sorting a Petabyte - The Databricks Blog