The pursuit of AI is one of the biggest priorities in data today. Databricks co-founders Reynold Xin and Patrick Wendell will cover the latest advancements on the Apache Spark and MLflow projects.
Additionally, the keynote will explore how data scientists are using Databricks to develop and operationalize machine learning models at scale, and the new enhancements to the Databricks platform to make the machine learning workflow even easier and more powerful.
Ali Ghodsi – Databricks
Brooke Wenig – Databricks
Clemens Mewald – Databricks
Patrick Baginski – McDonalds
Rohan Kumar – Microsoft
Bill Nye – Guest Speaker
Ali Ghodsi is the CEO and co-founder of Databricks, responsible for the growth and international expansion of the company. He previously served as the VP of Engineering and Product Management before taking the role of CEO in January 2016. In addition to his work at Databricks, Ali serves as an adjunct professor at UC Berkeley and is on the board at UC Berkeley’s RiseLab. Ali was one of the original creators of open source project, Apache Spark, and ideas from his academic research in the areas of resource management and scheduling and data caching have been applied to Apache Mesos and Apache Hadoop. Ali received his MBA from Mid-Sweden University in 2003 and PhD from KTH/Royal Institute of Technology in Sweden in 2006 in the area of Distributed Computing.
Brooke Wenig is a Machine Learning Practice Lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teach courses on distributed machine learning best practices. She is a co-author of Learning Spark, 2nd Edition, co-instructor of the Distributed Computing with Spark SQL Coursera course, and co-host of the Data Brew podcast. She received an MS in Computer Science from UCLA with a focus on distributed machine learning. She speaks Mandarin Chinese fluently and enjoys cycling.
As the Corporate Vice President of Azure Data, Rohan is the engineering leader responsible for the product strategy, technical vision, long range planning, design, development/implementation, and engineering process involving the certification and release of SQL Server and all Azure Data Services, including SQL DB, Cosmos DB, Database for MySQL, Database for PostgreSQL, Database for Maria DB, SQL Data Warehouse, Azure Databricks, Azure Data Lake, HDInsight, Azure Stream Analytics, Azure Data Factory, Azure Data Catalog and Microsoft’s Analytics Platform System (APS). As part of his charter, Rohan is focused on delivering core data platform services for Microsoft that allow IT professionals, DBAs, Data Scientists, Data Engineers and Developers to successfully develop, deploy, and manage data applications across Azure Data Services and SQL Server workloads. Rohan joined Microsoft in July 1998 as a software development engineer in the core Windows file systems and storage team. He contributed to Windows XP and Windows Server 2003 before moving to the SQL Server team in July 2003. He has held various levels of engineering leadership roles in SQL Server since then and has contributed to SQL Server 2005, SQL Server 2008, SQL Server 2008 R2, SQL Server 2012, SQL Server 2014, SQL Azure, HD Insight Service and APS. Rohan graduated with a Bachelor of Technology degree in Computer Science and Engineering from Indian Institute of Technology, BHU and a Master of Science degree in Computer Science from University of Massachusetts at Amherst.
Bill Nye—scientist, engineer, comedian, author, and inventor—is a man with a mission: to help foster a scientifically literate society and to help people everywhere understand and appreciate the science that makes our world work. Perhaps best known as Bill Nye the Science Guy, Bill’s wit and enthusiasm garnered 18 Emmy awards. Bill is also an author and frequent speaker on topics of global importance including climate change, evolution, population, space exploration, and STEM education. He is a steadfast champion of the unwavering value of critical thinking, science, and reason. Most recently, Nye was the subject of the documentary film Bill Nye: Science Guy. This behind the scenes portrait of Nye follows him as he takes off his Science Guy lab coat and takes on those who would deny climate change, evolution, and a science-based worldview. His 2014 debate with a creationist who believes the earth is only 6,000 years old has been viewed nearly 6.5 million times on YouTube. After that experience, Bill wrote his first book for a general audience, Undeniable: Evolution and the Science of Creation, which went on to become a New York Times bestseller. He is also the author of seven children’s books, including his most recent best-selling fiction series, Jack and the Geniuses. Nye also serves as CEO of the Planetary Society, the world’s largest space-interest group. And, as an inventor, he holds a number of unusual patents, including an improved toe shoe for ballerinas, a device to help people learn to throw a baseball better, a magnifier made of water, and an abacus that does arithmetic like the computer – with only binary numbers. Nye graduated with a mechanical engineering degree from Cornell University and returned there as a visiting professor from 2001-2005. In 2010, Bill joined the ranks of his astronomy professor Carl Sagan when he was elected the American Humanist Association’s “Humanist of the Year.” Through all of his work, Bill hopes to inspire people everywhere to change the world.
Reynold is an Apache Spark PMC member and the top contributor to the project. He initiated and led efforts such as DataFrames and Project Tungsten. He is also a co-founder and Chief Architect at Databricks.
Matei Zaharia is an Assistant Professor of Computer Science at Stanford University and Chief Technologist at Databricks. He started the Apache Spark project during his PhD at UC Berkeley in 2009, and has worked broadly in datacenter systems, co-starting the Apache Mesos project and contributing as a committer on Apache Hadoop. Today, Matei tech-leads the MLflow development effort at Databricks in addition to other aspects of the platform. Matei’s research work was recognized through the 2014 ACM Doctoral Dissertation Award for the best PhD dissertation in computer science, an NSF CAREER Award, and the US Presidential Early Career Award for Scientists and Engineers (PECASE).
Clemens Mewald leads the product team for Machine Learning and Data Science at Databricks. Previously, he spent four years on the Google Brain team building AI infrastructure for Alphabet, where his product portfolio included TensorFlow and TensorFlow Extended (TFX). Clemens holds an MSc in computer science from UAS Wiener Neustadt, Austria, and an MBA from MIT Sloan.
Kasey Uhlenhuth is a product manager on the machine learning team at Databricks. Before Databricks, she worked on the Visual Studio and C# team at Microsoft building developer productivity tools. Kasey holds an MBA from the Stanford Graduate School of Business and a BA in Computer Science from Harvard University.
Patrick Baginski has over 10 years of experience with a background in MLOps, applied machine learning, experimentation and product management. At McDonald’s, his key focus is leading McDonald’s data science and machine learning teams to build large scale ML applications across various platforms and powering Machine Learning operations for amazing customer experiences. Prior to McDonald’s, he was a member of McKinsey, providing value to clients as a data scientist and manager as well as at GoPro, Salesforce and in the mobility space.