Join over 7,000 data scientists, engineers and analysts to collaborate at the intersection of data and ML. You will learn about the latest advances in open-source technologies such as Apache Spark™ , Delta Lake, MLflow, TensorFlow, and PyTorch as well as best practices for deploying AI in the real world. You will also learn the latest on bleeding edge OSS technologies including Delta Lake, MLflow, and Koalas.
Author, The Signal And The Noise
Associate Provost of Data Science and Information, and Dean of the School of Information at UC Berkeley
Author and Maintainer, PyTorch
Co-founder & CEO
Original Creator of Apache Spark
Co-founder & Chief Technologist, Databricks Original Creator of Apache Spark™ & MLflow
Co-founder & Chief Architect, Databricks
Top Contributor & Original Creator of Apache Spark
Conveniently located in the South of Market area, Moscone West provides easy access to downtown San Francisco’s many hotels and restaurants — providing opportunity to enjoy the city after the sessions close. Take advantage of easy transportation via BART, MUNI and CalTrain.LEARN MORE + SEE HOTEL AND AIRFARE, CAR RENTAL DEALS
Apache Spark is a powerful open-source processing engine built around speed, ease of use, and sophisticated analytics. Spark began at UC, Berkeley in 2009, and it is now developed at the vendor-independent Apache Software Foundation. Since its initial release, Spark has seen rapid adoption by enterprises across wide-ranging industries. Internet powerhouses such as Facebook, Hotels.com, Cisco, Microsoft, and Netflix have deployed Spark at massive scale, processing multiple petabytes of data on clusters of more than 8,000 nodes. Apache Spark has also become the largest open-source community in big data, with more than 1,000 contributors from over 250 organizations. Learn more
Apache Spark™ Developers
Data and ML Engineers
Infrastructure / Site Reliability Engineers
Key Decision Makers
Data and AI need to be unified: the best AI applications require massive amounts of constantly updated training data to build state-of-the-art models. So far, Apache Spark™ is the only unified analytics engine that combines large-scale data processing with state-of-the-art machine learning and AI algorithms.
Combining Spark + AI topics, this conference is a unique “one-stop shop” for developers, data scientists, and tech executives seeking to apply the best tools in data and AI to build innovative products. Join more than 7,000 engineers, data scientists, AI experts, researchers, and business professionals for three days of in-depth learning and networking.
The sessions and training at this conference will cover data engineering and data science content, along with best practices for productionizing AI: keeping training data fresh with stream processing, quality monitoring, testing, and serving models at a massive scale. The conference will also include deep-dive sessions on popular software frameworks—e.g., Delta Lake, MLflow, TensorFlow, SciKit-Learn, Keras, PyTorch, DeepLearning4J, BigDL, and deep learning pipelines.