Databricks Bi-Weekly Digest: 8/31/16August 31, 2016 by Jules Damji in Engineering Blog Here’s our recap of what’s transpired with Apache Spark since our previous digest . Databricks CTO and Co-founder Matei Zaharia presented “Unifying big...
How to use SparkSession in Apache Spark 2.0August 15, 2016 by Jules Damji in Engineering Blog Generally, a session is an interaction between two or more entities. In computer parlance, its usage is prominent in the realm of networked...
Databricks Bi-Weekly Digest: 8/8/16August 8, 2016 by Jules Damji in Engineering Blog Continuing with our bi-weekly digest series, here’s our recap of what’s transpired over the last two weeks with Apache Spark since our previous...
Spark Structured StreamingJuly 28, 2016 by Matei Zaharia, Tathagata Das, Michael Lumb and Reynold Xin in Engineering Blog Apache Spark 2.0 adds the first version of a new higher-level API, Structured Streaming, for building continuous applications . The main goal is...
Continuous Applications: Evolving Streaming in Apache Spark 2.0July 28, 2016 by Matei Zaharia in Engineering Blog Since its release, Spark Streaming has become one of the most widely used distributed streaming engines, thanks to its high-level API and exactly-once...
Introducing Apache Spark 2.0July 26, 2016 by Reynold Xin, Michael Lumb and Matei Zaharia in Engineering Blog Today, we're excited to announce the general availability of Apache Spark 2.0 on Databricks. This release builds on what the community has learned...
Databricks Bi-Weekly Digest: 7/18/16July 18, 2016 by Jules Damji in Engineering Blog Today, we're kicking off a new series: the Databricks Bi-Weekly Digest. Our goal with this digest is to summarize Spark related content, compiled...
A Tale of Three Apache Spark APIs: RDDs vs DataFrames and DatasetsJuly 14, 2016 by Jules Damji in Engineering Blog Of all the developers' delight, none is more attractive than a set of APIs that make developers productive, that is easy to use...
SparkR Tutorial at useR 2016July 7, 2016 by Hossein Falaki and Shivaram Venkataraman in Solutions AMPLab and Databricks gave a tutorial on SparkR at the useR conference. The conference was held from June 27 - June 30 at...
Apache Spark Key Terms, ExplainedJune 22, 2016 by Jules Damji and Denny Lee in Engineering Blog This article was originally posted on KDnuggets The Spark Summit Europe call for presentations is open, submit your idea today As observed in...