Integration of AWS Data Pipeline with Databricks: Building ETL pipelines with Apache SparkJanuary 23, 2017 by Peyman Mohajerian in Product This is one of a series of blogs on integrating Databricks with commonly used software packages. See the “What’s Next” section at the...
Real-time Streaming ETL with Structured Streaming in Apache Spark 2.1January 19, 2017 by Tathagata Das, Michael Armbrust and Tyson Condie in Engineering Blog Explore why lakehouses are the data architecture of the future with the father of the data warehouse, Bill Inmon. Try this notebook in...
On-Demand Webinar and FAQ: Apache Spark - The Unified Engine for All WorkloadsJanuary 18, 2017 by Wayne Chan in Company Blog Last week, we held a live webinar — Apache Spark - The Unified Engine for All Workloads — to explain the real-world benefits...
5 Reasons to Attend Spark Summit East 2017January 10, 2017 by Jules Damji in Company Blog Spark Summit East will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it...
5 Can’t Miss Talks at Spark Summit East 2017January 9, 2017 by Wayne Chan in Company Blog If you haven’t been to a Spark Summit yet, you are missing out on the biggest gathering of Apache Spark experts and enthusiasts...
Databricks and Apache Spark 2016 Year in ReviewJanuary 3, 2017 by Reynold Xin, Jules Damji, Dave Wang and Matei Zaharia in Company Blog Spark Summit will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it sells...
Spark Live 2016 Tour RecapJanuary 3, 2017 by Wayne Chan in Company Blog The Apache Spark community had quite the year in 2016. It has maintained its billing as the largest and most active open source...
Top 10 Apache Spark Blog Posts from 2016December 30, 2016 by Jules Damji in Engineering Blog Spark Summit will be held in Dublin, Ireland on Oct 24-26, 2017. Check out the get your ticket before it sells out! Here’s...
Introducing Apache Spark 2.1December 28, 2016 by Reynold Xin in Engineering Blog Spark Summit will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it sells...
10 Things I Wish I Knew Before Using Apache SparkRDecember 28, 2016 by Neil Dewar in Engineering Blog This is a guest post from Neil Dewar , a senior data science manager at a global asset management firm. In this blog...