Keynotes: Lakehouse Data Architecture, Data Engineering, and Analytics

Wednesday, May 26, 08:00 AM (PT)

Hear from Databricks co-founders and the original creators of popular projects Apache Spark, Delta Lake and MLflow on how the open source community is tackling the biggest challenges in data.

They’ll also reveal some of the latest innovations in data engineering and data analytics to simplify and scale your work.  We’ll also be joined by data leaders from Atlassian and Microsoft, as well as the Nobel Laureate Malala Yousafzai, an inspiring human rights advocate.

Watch All Sessions

Future is Open. Lakehouse is Here | Ali Ghodsi | Keynote Data + AI Summit NA 2021

Ali Ghodsi Co-founder & CEO Original Creator of Apache Spark, Databricks

Databricks CEO Ali Ghodsi kicks off Summit, live from the Lakehouse. Ali talks about momentum in open source data technologies and the growing adoption of the data lakehouse architecture, combining the best of data warehouses and data lakes. The lakehouse is one platform to unify all your data, analytics and AI workloads.

Ali Ghodsi and Bill Inmon | Fireside Chat | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Bill Inmon, Computer scientist, author, and technology pioneer. Best known as the Father of Data Warehousing, Father of Data Warehouse

Databricks CEO Ali Ghodsi interviews Bill Inmon, the “Father of the Data Warehouse,” about the industry’s evolution to the Lakehouse architecture. Bill discusses the need for an open lakehouse architecture built on top of data lakes that natively supports data warehousing and machine learning. Bill says enterprises who don’t build a Lakehouse will have a mountain of data that goes to waste. The lakehouse will unlock the data and present opportunities we’ve never seen before.

Announcing Delta Lake 1.0 | Michael Armbrust | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Michael Armbrust, Distinguished Engineer, Databricks

Delta Lake co-creator and Databricks Distinguished Engineer Michael Armbrust announces the Delta Lake 1.0 milestone and key features including: generated columns, querying from data federated across multiple clouds, standalone Delta Lake in Python and more. He also introduces a set of new open source committers.

Building the lakehouse at Atlassian | Rohan Dhupelia | Keynote Data + AI Summit NA 2021

Michael Armbrust, Distinguished Engineer, Databricks  •  Rohan Dhupelia Data Platform Senior Manager, Atlassian

Rohan Dhupelia of Atlassian talks about the evolution of their internal data architecture to the lakehouse as the “sweet spot” in between the data warehouse and data lake.

Announcing Delta Sharing with Demo | Matei Zaharia | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Matei Zaharia, Assistant Professor of Computer Science; Original Creator of Apache Spark & MLflow, Databricks

“Data needs to flow beyond the borders of individual organizations,” says Databricks CEO Ali Ghodsi. He announces Delta Sharing, the industry’s first open protocol for secure data sharing, as open source under the Linux Foundation. Databricks Chief Technologist Matei Zaharia dives into the goals and the details of being a data provider or a data recipient.

Ali Ghodsi and Matt Garman (SVP, AWS) | Fireside Chat | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Matt Garman, Senior Vice President, AWS WW Sales and Marketing, AWS

Matt Garman, SVP at AWS, talks about some of the early lessons of scale at Amazon Web Services. Matt also addresses the trends around lakehouse adoption that AWS has observed, and the advantages that Delta Sharing can bring to data sharing.

Announcing Delta Live Tables with Demo | Michael Armbrust | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databrickss  •  Michael Armbrust, Distinguished Engineer, Databricks

Distinguished Engineer Michael Armbrust announces Delta Live Tables, making it possible to do production-quality ETL using only SQL queries. Live Tables runtime takes care of operational, governance and quality concerns, allowing you to spend more time getting value from the data. Can even mix Python with SQL to do advanced analytics and AI. Learn more at databricks.com.

Announcing the Unity Catalog | Matei Zaharia | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Matei Zaharia, Assistant Professor of Computer Science; Original Creator of Apache Spark & MLflow, Databricks

Databricks CEO Ali Ghodsi announces the Unity Catalog, the industry’s first unified catalog for the Lakehouse. It allows organizations to standardize on one security model based on ANSI SQL. Chief Technologist Matei Zaharai then dives into the details on governance challenges solved by the Unity Catalog.

SQL Analytics & Photon Updates with Demo | Reynold Xin | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Reynold Xin, Co-founder & Chief Architect, Databricks

Databricks Chief Architect Reynold Xin talks about the performance improvements and simplified administration now available in Photon and SQL Analytics. Get a first-class SQL development experience backed by an engine with improved concurrent querying capabilities.

Ali Ghodsi and Rohan Kumar (CVP, Microsoft) | Fireside Chat | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Rohan Kumar, Corporate Vice President, Azure Data, Microsoft

Rohan Kumar, CVP of Azure Data at Microsoft, shares what customers like Grab and ABN AMRO are able to achieve with data and AI using Azure Databricks, and how innovations like Photon and Delta Sharing are coming to life on Azure.

Malala Yousafzai and Ali Ghodsi | Fireside Chat | Keynote Data + AI Summit NA 2021

Ali Ghodsi, Co-founder & CEO Original Creator of Apache Spark, Databricks  •  Malala Yousafzai, Co-Founder of Malala Fund and Nobel Laureate

Malala, an internationally-recognized activist, joins to share her work to enable every girl around the globe to have access to high quality education.

Ali Ghodsi

Ali Ghodsi

Co-founder and CEO at Databricks

In addition to leading Databricks, Ali is an original creator of Apache Spark and an Adjunct Professor at the University of California, Berkeley. Ali will be leading the morning keynotes, kicking them off by talking about the opportunities recent innovations enable around simplifying and scaling data.

Bill Inmon

Bill Inmon

Father of Data Warehousing

Computer scientist, author and technology pioneer Bill Inmon will join a fireside chat with Databricks Co-founder and CEO Ali Ghodsi. They’ll talk about the evolution of data infrastructure – from data warehouses to data lakes to data lakehouses – and give a preview of Bill’s upcoming book.

Michael Armbrust

Michael Armbrust

Original Creator of Spark SQL and PMC Member

Michael Armbrust, a Distinguished Engineer and leader of the Delta Lake and Streaming efforts at Databricks, will review the momentum of the Delta Lake open source project and share some of the latest initiatives the team has been working on.

Rohan Dhupelia

Rohan Dhupelia

Leader of Analytics Platform at Atlassian

Atlassian, makers of the popular Jira, Trello and Bitbucket, made the journey to the lakehouse to enable data democratization at scale. Rohan will discuss the cost and challenges of their data warehouse origins, including data duplication, data latency and concurrency issues. Rohan will also dive into the benefits of their latest data lakehouse, which reduced cost, simplified access and governance and increased the pace of innovation with greater autonomy for teams.

Matei Zaharia

Matei Zaharia

Original creator of MLflow and Apache Spark

Matei, Co-founder and Chief Technologist of Databricks and an Assistant Professor at Stanford University, will talk about the latest features in both open source and the Databricks Lakehouse Platform.

Matt Garman

Matt Garman

SVP of Amazon Web Services

Matt will discuss his experience launching Amazon EC2 and chat with Databricks Co-founder and CEO about their perspectives on the emergence of the Lakehouse. They’ll also wrap up the conversation with insights into the new keynote announcements.

Reynold Xin

Reynold Xin

Top contributor to Apache Spark

Reynold, Co-founder and Chief Architect at Databricks, will share updates in open source and Databricks that improve performance and scale of SQL analytics using a lakehouse architecture.

Rohan Kumar

Rohan Kumar

Corporate Vice President, Azure Data, Microsoft

As the Corporate Vice President of Azure Data, Rohan is the engineering leader responsible for the product strategy, technical vision, long range planning, design, development/implementation, and engineering process involving the certification and release of SQL Server and all Azure Data Services, including SQL DB, Cosmos DB, Database for MySQL, Database for PostgreSQL, Database for Maria DB, SQL Data Warehouse, Azure Databricks, Azure Data Lake, HDInsight, Azure Stream Analytics, Azure Data Factory, Azure Data Catalog and Microsoft’s Analytics Platform System (APS).

Malala Yousafzai

Malala Yousafzai

Nobel Laureate and Cofounder of Malala Fund

Malala, an internationally-recognized activist, joins to share her work to enable every girl around the globe to have access to high quality education.