There’s never been a more important moment for data teams. Together, we can solve the world’s toughest problems — and it starts with Spark + AI Summit. We’ve transformed this year’s Summit into a global event — totally virtual and open to everyone, free of charge. And Summit is now even bigger: extended to five days with 200+ sessions, 4x the training, and keynotes by visionaries and thought leaders. Join tens of thousands of engineers, scientists, developers, analysts and leaders as we shape the future of big data, analytics and AI.
Associate Provost, Division of Computing, Data Science, and Society (CDSS)
Maintainer of PyTorch
Creator of Keras
West Coast Head of Engineering of FAIR
Digital Forensics Pioneer
Co-founder & CEO
Original Creator of Apache Spark™
Co-founder & Chief Technologist, Databricks
Original Creator of Apache Spark™ & MLflow
Machine Learning Practice Lead
Director of Product Management
Principal Product Architect
Corporate Vice President, Azure Data
Director of Data and Analytics Engineering
Spark + AI Summit 2020 training begins on June 22-23, with an expanded curriculum of half-day and all-day classes. These training classes will include both lecture and hands-on exercises. Apache Spark™ 2.x certification is also offered as an exam, with an optional half-day prep course.
Conveniently located in the South of Market area, Moscone West provides easy access to downtown San Francisco’s many hotels and restaurants — providing opportunity to enjoy the city after the sessions close. Take advantage of easy transportation via BART, MUNI and CalTrain.LEARN MORE + SEE HOTEL AND AIRFARE, CAR RENTAL DEALS
Apache Spark™ is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics. Spark began at UC Berkeley in 2009, and it is now developed at the vendor-independent Apache Software Foundation. Since its initial release, Spark has seen rapid adoption by enterprises across wide-ranging industries. Internet powerhouses such as Facebook, Hotels.com, Cisco, Microsoft, and Netflix have deployed Spark at massive scale, processing multiple petabytes of data on clusters of more than 8,000 nodes. Apache Spark™ has also become the largest open-source community in big data, with more than 1,000 contributors from over 250 organizations. Learn more
Apache Spark™ Developers
Data and ML Engineers
Infrastructure / Site Reliability Engineers
Key Decision Makers
Data and AI need to be unified. But the best AI applications require massive amounts of constantly updated training data to build state-of-the-art models. Apache Spark™ is the only unified analytics engine that combines large-scale data processing with state-of-the-art machine learning and AI algorithms.
Combining Spark + AI topics, this five-day virtual conference delivers a one-stop shop for developers, data scientists and tech executives seeking to apply the best tools in data and AI to build innovative products. Join tens of thousands of engineers, data scientists, AI experts, researchers and business professionals for five days of in-depth learning and networking.
Sessions and training will cover data engineering and data science content, along with best practices for productionizing AI — keeping training data fresh with stream processing, quality monitoring, testing, and serving models at a massive scale. The conference will also include deep-dive sessions on popular software frameworks like Delta Lake, MLflow, TensorFlow, SciKit-Learn, Keras, PyTorch, DeepLearning4J, BigDL and deep learning pipelines.