Spark Survey 2015 Results are now availableSeptember 24, 2015 by Matei Zaharia, Patrick Wendell and Denny Lee in Announcements We ran the Spark Survey 2015 this summer to gain insights on how organizations are using Apache Spark. The results of this year’s...
Easier Spark Code DebuggingSeptember 23, 2015 by Chaoyu Yang in Product To try the features mentioned in this blog, sign up for a 14-day free trial of Databricks today . We are excited to...
Large Scale Topic Modeling: Improvements to LDA on Apache SparkSeptember 22, 2015 by Feynman Liang, Yuhao Yang and Joseph Bradley in Engineering Blog This blog was written by Feynman Liang and Joseph Bradley from Databricks, and Yuhao Yang from Intel. To get started using LDA, download...
Apache Spark 1.5 DataFrame API Highlights: Date/Time/String Handling, Time Intervals, and UDAFsSeptember 16, 2015 by Michael Armbrust, Yin Huai, Davies Liu and Reynold Xin in Engineering Blog To try new features highlighted in this blog post, download Spark 1.5 or sign up Databricks for a 14-day free trial today...
Announcing Apache Spark 1.5September 8, 2015 by Reynold Xin and Patrick Wendell in Engineering Blog The inaugural Spark Summit Europe will be held in Amsterdam this October. Check out the full agenda and get your ticket before it...
Spark Summit Europe Full Agenda Available OnlineAugust 31, 2015 by Scott Walent and Denny Lee in Company Blog This October, join the Apache Spark community in Amsterdam at the Beurs Van Berlage for the very first Spark Summit in Europe! We...
Apache Spark 1.5 Preview Now Available in DatabricksAugust 18, 2015 by Reynold Xin and Michael Lumb in Product We are excited to announce that starting today, Apache Spark 1.5.0 is available as a preview in Databricks. Our users can now choose...
From Pandas to Apache Spark's DataFrameAugust 12, 2015 by Olivier Girardot in Engineering Blog This is a cross-post from the blog of Olivier Girardot. Olivier is a software engineer and the co-founder of Lateral Thoughts, where he...
Helping the Democratization of Big DataAugust 5, 2015 by Ali Ghodsi in Company Blog When we started Databricks, we thought that extracting insights from big data was insanely difficult for no good reason. You almost needed an...
Guest blog: SequoiaDB Connector for Apache SparkAugust 3, 2015 by Tao Wang in Company Blog This is a guest blog from Tao Wang at SequoiaDB . He is the co-founder and CTO of SequoiaDB, leading its long-term technology...