Data lakes have grown in popularity, and the Delta Lake Open Source project has gained significant momentum in parallel, by helping teams extend the flexibility and data recency of data lakes, with the increased data quality and reliability needed for downstream data science, machine learning, and business analytics. Hear about the latest developments around Delta Lake and growth in the community.
Michael Armbrust is committer and PMC member of Apache Spark and the original creator of Spark SQL. He currently leads the team at Databricks that designed and built Structured Streaming and Databricks Delta. He received his PhD from UC Berkeley in 2013, and was advised by Michael Franklin, David Patterson, and Armando Fox. His thesis focused on building systems that allow developers to rapidly build scalable interactive applications, and specifically defined the notion of scale independence. His interests broadly include distributed systems, large-scale structured storage and query optimization.