Announcing Apache Spark 1.6January 4, 2016 by Michael Lumb, Patrick Wendell and Reynold Xin in Engineering Blog To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . Today we are happy to announce...
Introducing Apache Spark DatasetsJanuary 4, 2016 by Michael Armbrust, Wenchen Fan, Reynold Xin and Matei Zaharia in Engineering Blog Developers have always loved Apache Spark for providing APIs that are simple yet powerful, a combination of traits that makes complex analysis possible...
The Best of The Databricks Blog: Most Read Posts of 2015December 22, 2015 by Dave Wang in Engineering Blog Databricks developers are prolific blog authors when they are not writing code for the Databricks platform or Apache Spark. As 2015 draws to...
Guest Blog: Streamliner - An Open Source Apache Spark Streaming ApplicationDecember 18, 2015 by Ankur Goyal in Company Blog This is a guest blog from Ankur Goyal, VP of Engineering at MemSQL Our always-on interconnected world constantly shuttles data between devices and...
Succinct Spark from AMPLab: Queries on Compressed RDDsNovember 10, 2015 by Rachit Agarwal and Anurag Khandelwal in Engineering Blog This is a guest post from Rachit Agarwal and Anurag Khandelwal of the UC Berkeley AMPLab, leads of an ongoing research project called...
Announcing the TFOCS for Spark Optimization PackageNovember 2, 2015 by Aaron Staple in Engineering Blog Aaron is the developer of this Apache Spark package, with support from Databricks. Aaron is a freelance software developer with experience in data...
Introducing Redshift Data Source for SparkOctober 19, 2015 by Sameer Wadkar and Josh Rosen in Engineering Blog This is a guest blog from Sameer Wadkar, Big Data Architect/Data Scientist at Axiomine. The Spark SQL Data Sources API was introduced in...
Generalized Linear Models in SparkR and R Formula Support in MLlibOctober 5, 2015 by Eric Liang in Engineering Blog To get started with SparkR, download Apache Spark 1.5 or sign up for a 14-day free trial of Databricks today . Apache Spark...
Apache Spark 1.5.1 and What do Version Numbers Mean?October 1, 2015 by Reynold Xin in Engineering Blog The inaugural Spark Summit Europe will be held in Amsterdam on October 27 - 29. Check out the full agenda and get your...
Improved Frequent Pattern Mining in Apache Spark 1.5: Association Rules and Sequential PatternsSeptember 28, 2015 by Feynman Liang, Jiajin Zhang and Dandan Tu in Engineering Blog We would like to thank Jiajin Zhang and Dandan Tu from Huawei for contributing to this blog. To get started mining patterns from...