Skip to main content

Spark Summit 2017 will be held at Moscone West in San Francisco June 5-7, 2017

Get ready! In less than two weeks, thousands of developers, data scientists, analysts, researchers and business executives from around the world will gather at the Moscone West Convention Center in San Francisco, June 5-7, for the 10th edition of Spark Summit.

Save 15% by using promo code DATABRICKS. Register today.

Co-chaired by Reynold Xin and Edd Wilder-James, Spark Summit 2017 features more than 175 sessions dedicated to all things Apache Spark, with an emphasis on the latest developments in data science, deep learning, machine learning and real-time streaming applications. From deep dive technical tutorials and cutting-edge research projects to real-world case studies, it’s a comprehensive look at how Spark is being used across a variety of industries and applications to solve tough, big data challenges at scale.

Databricks will be making a number of exciting product announcements on the main stage at Spark Summit (wish we could say more now, but you’ll need to wait until the big event). In addition, our team will be delivering several community talks to help you improve your use of Spark and you’ll get to hear first hand from customers that are leveraging Databricks to accelerate business outcomes.

If you haven’t already registered, do so now! Tickets are selling quickly and there’s no better place to meet with, and learn from, key members of the Spark ecosystem.

Here are some of the must-see highlights:

Keynotes and Demos

  • Spark creator and Databricks Chief Technologist Matei Zaharia will kick off Developer Day with a keynote about expanding Apache Spark use cases in 2.2 and beyond, that looks at Databricks’ Structured Streaming API and new machine learning libraries, followed by a demo by Databricks software engineers Tim Hunter and Michael Armbrust.
  • Stanford University associate professor Christopher Re will describe the open source machine learning system Snorkel that aims to make processing Dark Data easier, and will provide a set of tutorials to help you write Snorkel applications that use Spark.
  • Intel VP and GM Michael Greene will discuss BigDL, the recently released open source distributed deep learning framework, and plans for expanding the BigDL ecosystem.
  • O’Reilly Media chief data scientist Ben Lorica will host a fireside chat with Databricks executive chairman and UC Berkeley professor Ion Stoica about Berkeley’s new RISELab, the successor to AMPlab.
  • Riot Games senior data scientist Wes Kerr will share how the gaming company’s Player Behavior Team uses Spark to better understand and combat abusive language.
  • Databricks CEO Ali Ghodsi will share some big news that will be demoed by Databricks engineer Greg Owen.
  • Hotels.com VP and chief data science officer Matt Fryer will talk about the online travel company’s journey to becoming an algorithmic business using Spark.
  • Author and analytics guru Eric Siegel will share insights on how to get predictive analytics right.

Community Talks

  • Apache Spark MLlib’s Past Trajectory and New Directions (Joseph Bradley, Databricks)
  • Easy, Scalable, Fault-Tolerant Stream Processing with Structured Streaming in Apache Spark (Michael Armbrust and Tathagata Das, Databricks)
  • Embracing a Taxonomy of Types to Simplify Machine Learning (Leah McGuire, Salesforce)
  • Herding Cats: Migrating Dozens of Oddball Analytics Systems to Apache Spark (Jon Cavanaugh, HP)
  • Leveraging Spark in an E-commerce Platform to Democratize Data (Shafaq Abdullah, Honest Company)
  • Real-Time Machine Learning Analytics Using Structured Streaming and Kinesis Firehose (Caryl Yuhas and Myles Baker, Databricks)
  • Spark, GraphX and Blockchains: Building a Behavioral Analytics Platform for Forensics, Fraud and Finance (Bryan Cheng & Karen Hsu, BlockCypher)
  • SSR: Structured Streaming on R for Machine Learning (Felix Cheung, Microsoft)

Connect with Databricks and the Spark Community

There will also be office hours with Spark committers, an expanded Expo Hall with dozens of exhibitors, and a ton of networking opportunities -- including an Apache Spark Meetup and the JOIN closing night party which will include food, drinks, games, music and more.

Don't miss the Spark Summit 2017 JOIN closing night party!

Sign Up Today

See the full schedule and sign up now to join us at Spark Summit 2017. Use promo code DATABRICKS to save 15% off.

Try Databricks for free

Related posts

10th Spark Summit Sets Another Record of Attendance

June 9, 2017 by Jules Damji and Wayne Chan in
We have assembled a selected collage of highlights from Databricks’ speakers at our 10th Spark Summit, a milestone for Apache Spark community and...

How to Build a Credit Data Platform on the Databricks Lakehouse

Get started and build a credit data platform for your business by visiting the demo at Databricks Demo Center. Introduction According to the...

Near Real-Time Anomaly Detection with Delta Live Tables and Databricks Machine Learning

Why is Anomaly Detection Important? Whether in retail, finance, cyber security, or any other industry, spotting anomalous behavior as soon as it happens...
See all Company Blog posts