NOW ON-DEMAND

The Virtual Event for
Data Teams
June 22-26, 2020

*Open until July 3

Slider

Data Teams Unite!

There’s never been a more important moment for data teams. Together, we can solve the world’s toughest problems — and it starts with Spark + AI Summit. We’ve transformed this year’s Summit into a global event — totally virtual and open to everyone, free of charge. And Summit is now even bigger: extended to five days with 200+ sessions, 4x the training, and keynotes by visionaries and thought leaders. Join tens of thousands of engineers, scientists, developers, analysts and leaders as we shape the future of big data, analytics and AI.

 

Spark + AI Summit 2020 Keynote Speakers

Nate Silver

Founder, FiveThirtyEight.com

Jennifer Chayes

Prof. Jennifer Chayes

Associate Provost, Division of Computing, Data Science, and Society (CDSS)

Adam Paszke

Adam Paszke

Maintainer of PyTorch

Kim Hazelwood

Kim Hazelwood

West Coast Head of Engineering of FAIR

Dr. Phillip Goff

Dr. Phillip Atiba Goff

Co-Founder and President

Hany Farid

Prof. Hany Farid

Digital Forensics Pioneer

Ali Ghodsi

Ali Ghodsi

Co-founder & CEO
Original Creator of Apache Spark™

Matei Zaharia

Matei Zaharia

Co-founder & Chief Technologist, Databricks
Original Creator of Apache Spark™ & MLflow

Brooke Wenig

Brooke Wenig

Machine Learning Practice Lead

Clemens Mewald

Clemens Mewald

Director of Product Management

Amy Heineike

Amy Heineike

Principal Product Architect

Rohan Kumar

Rohan Kumar

Corporate Vice President, Azure Data

Vish Subramanian

Vish Subramanian

Director of Data and Analytics Engineering

Sue Ann Hong

Sue Ann Hong

Software Engineer

Lauren Richie

Lauren Richie

Software Engineer

Reynold Xin

Reynold Xin

Co-founder & Chief Architect

Sarah Bird

Sarah Bird

Principal Program Manager

Leide Carbal

Leide Cabral

Diversity and Inclusion Program Manager

Anurag Sehgal

Anurag Sehgal

Managing Director, Credit Suisse Global Markets
Credit Suisse

Training & Certification

About:

Spark + AI Summit 2020 training begins on June 22-23, with an expanded curriculum of half-day and all-day classes. These training classes will include both lecture and hands-on exercises. Apache Spark™ 2.x certification is also offered as an exam, with an optional half-day prep course.

Certification:

  • Half-day Prep course + Apache Spark 2.x Certification Exam
  • Databricks Certification Exams

Training:

  • Find out what lies ahead for popular open-source projects – Apache Spark™, Delta Lake, MLflow and Koalas
  • Learn real-world Artificial Intelligence use cases from leading companies
  • Discover best practices for building, deploying and productionizing ML models
  • Learn about the latest deep-learning frameworks and Apache Spark Integrations
  • Hear how to improve performance and memory optimization from Spark committers
  • Learn how to leverage Structured Streaming in your ETL and real-time analytics
  • Get insight into how Data and AI together are unified to innovate business
Learn More

Conference Pass

Conference Pass
Pricing
GeneralAccess to sessions, keynotes, and virtual events. Pre-conference training not included.
FREE
VIPAccess to sessions, keynotes, virtual events + exclusive perks such AMA (Ask Me Anything) sessions with Spark Committers; priority appointments for the Advisory Bar where you can book 1:1 meetings with Subject Matter Experts; exclusive content/sessions for just VIPs – specific sessions TBA; Swag bag delivery; Win bonus points in the virtual game at our booth. Pre-conference training not included. Limited availability
$99
Training Pass
Training courses are sold separately. Training-only passes are available.

Half-Day Courses

$200

Full-Day Courses
These full-day courses will be broken into 2 half-day classes. Details on courses will be sent to you directly.

$400

Group Pricing
Groups of 4 or more get 20% off. Please contact registration@spark-summit.org for more information.

$160 / $320
Certification

Half-Day Prep Course + Apache Spark™ Certification Exam

$200

Apache Spark™ Certification Exam

$200
CONFERENCE PASS
GeneralAccess to sessions, keynotes, and virtual events. Pre-conference training not included.
PRICING
FREE
VIPAccess to sessions, keynotes, virtual events + exclusive perks such AMA (Ask Me Anything) sessions with Spark Committers; priority appointments for the Advisory Bar where you can book 1:1 meetings with Subject Matter Experts; exclusive content/sessions for just VIPs – specific sessions TBA; Swag bag delivery; Win bonus points in the virtual game at our booth. Pre-conference training not included. Limited availability
PRICING
$99
Training Pass
Training courses are sold separately. Training-only passes are available.

Half-Day Courses

PRICING
$200

Full-Day Courses
These full-day courses will be broken into 2 half-day classes. Details on courses will be sent to you directly.

PRICING
$400

Group Pricing
Groups of 4 or more get 20% off. Please contact registration@spark-summit.org for more information.

PRICING
$160 / $320
Certification

Half-Day Prep Course + Apache Spark™ Certification Exam

PRICING
$200

Apache Spark™ Certification Exam

PRICING
$200
PRICING

VENUE

MOSCONE WEST CONVENTION CENTER

Conveniently located in the South of Market area, Moscone West provides easy access to downtown San Francisco’s many hotels and restaurants — providing opportunity to enjoy the city after the sessions close. Take advantage of easy transportation via BART, MUNI and CalTrain.

LEARN MORE + SEE HOTEL AND AIRFARE, CAR RENTAL DEALS

Apache Spark™ is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics. Spark began at UC Berkeley in 2009, and it is now developed at the vendor-independent Apache Software Foundation. Since its initial release, Spark has seen rapid adoption by enterprises across wide-ranging industries. Internet powerhouses such as Facebook, Hotels.com, Cisco, Microsoft, and Netflix have deployed Spark at massive scale, processing multiple petabytes of data on clusters of more than 8,000 nodes. Apache Spark™ has also become the largest open-source community in big data, with more than 1,000 contributors from over 250 organizations. Learn more

Why Attend Spark + AI Summit?

Reasons To Attend

  • Discover real-world use cases for AI
  • Learn how to build reliable and fast data lakes with Delta Lake and Apache Spark
  • Stay abreast with best practices for productionizing machine learning lifecycles
  • Learn about the latest deep learning frameworks and Apache Spark Integration
  • Find out what lies ahead for the open source Spark project
  • Hear how to improve performance and memory optimization from Spark committers
  • Learn how to leverage Structured Streaming in your ETL and real-time analytics
  • Get tips and tools to process big data more quickly and efficiently from leading data scientists and researchers
  • See how leading companies successfully deploy Apache Spark at scale
  • Find out how real-world AI use cases are innovating business products
  • Get insight into how data and AI together are unified to innovate business
  • Learn how Spark is employed in a variety of enterprise applications
  • Hear how other enterprise Spark users solve business problems

Who Will Attend

Apache Spark™ Developers
Data and ML Engineers
Data Scientists
Infrastructure / Site Reliability Engineers
Researchers
Data Practitioners
Key Decision Makers
Business Executives
 

The World's Largest Gathering of Data Teams For the Apache Spark Community

Spark + AI Summit 2020

Data and AI need to be unified. But the best AI applications require massive amounts of constantly updated training data to build state-of-the-art models. Apache Spark™ is the only unified analytics engine that combines large-scale data processing with state-of-the-art machine learning and AI algorithms.

Combining Spark + AI topics, this five-day virtual conference delivers a one-stop shop for developers, data scientists and tech executives seeking to apply the best tools in data and AI to build innovative products. Join tens of thousands of engineers, data scientists, AI experts, researchers and business professionals for five days of in-depth learning and networking.

Sessions and training will cover data engineering and data science content, along with best practices for productionizing AI — keeping training data fresh with stream processing, quality monitoring, testing, and serving models at a massive scale. The conference will also include deep-dive sessions on popular software frameworks like Delta Lake, MLflow, TensorFlow, SciKit-Learn, Keras, PyTorch, DeepLearning4J, BigDL and deep learning pipelines.

 

Learn About:

  • What’s coming next in Apache Spark™, Delta Lake, MLflow, and Koalas
  • Best practices for managing the machine learning lifecycle
  • Tips for building reliable data pipelines at scale
  • Latest developments in popular deep learning and machine learning frameworks
  • Practical, real-world use cases for AI

Tracks – Personalize Your Experience:

  • AI
  • Data Science
  • Deep Learning Techniques
  • Productionizing ML
  • Developer
  • Enterprise
  • Python & Advanced Analytics
  • Research
  • Technical Deep Dives
  • Apache Spark™ Use Cases & Ecosystem