Apache Spark 2015 Year In ReviewJanuary 4, 2016 by Reynold Xin, Matei Zaharia and Patrick Wendell in Solutions To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . 2015 has been a year of...
Announcing Apache Spark 1.6January 4, 2016 by Michael Lumb, Patrick Wendell and Reynold Xin in Engineering Blog To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . Today we are happy to announce...
Introducing Apache Spark DatasetsJanuary 4, 2016 by Michael Armbrust, Wenchen Fan, Reynold Xin and Matei Zaharia in Engineering Blog Developers have always loved Apache Spark for providing APIs that are simple yet powerful, a combination of traits that makes complex analysis possible...
The Best of The Databricks Blog: Most Read Posts of 2015December 22, 2015 by Dave Wang in Engineering Blog Databricks developers are prolific blog authors when they are not writing code for the Databricks platform or Apache Spark. As 2015 draws to...
Succinct Spark from AMPLab: Queries on Compressed RDDsNovember 10, 2015 by Rachit Agarwal and Anurag Khandelwal in Engineering Blog This is a guest post from Rachit Agarwal and Anurag Khandelwal of the UC Berkeley AMPLab, leads of an ongoing research project called...
Announcing the TFOCS for Spark Optimization PackageNovember 2, 2015 by Aaron Staple in Engineering Blog Aaron is the developer of this Apache Spark package, with support from Databricks. Aaron is a freelance software developer with experience in data...
Generalized Linear Models in SparkR and R Formula Support in MLlibOctober 5, 2015 by Eric Liang in Engineering Blog To get started with SparkR, download Apache Spark 1.5 or sign up for a 14-day free trial of Databricks today . Apache Spark...
Apache Spark 1.5.1 and What do Version Numbers Mean?October 1, 2015 by Reynold Xin in Engineering Blog The inaugural Spark Summit Europe will be held in Amsterdam on October 27 - 29. Check out the full agenda and get your...
Improved Frequent Pattern Mining in Apache Spark 1.5: Association Rules and Sequential PatternsSeptember 28, 2015 by Feynman Liang, Jiajin Zhang and Dandan Tu in Engineering Blog We would like to thank Jiajin Zhang and Dandan Tu from Huawei for contributing to this blog. To get started mining patterns from...
Large Scale Topic Modeling: Improvements to LDA on Apache SparkSeptember 22, 2015 by Feynman Liang, Yuhao Yang and Joseph Bradley in Engineering Blog This blog was written by Feynman Liang and Joseph Bradley from Databricks, and Yuhao Yang from Intel. To get started using LDA, download...