Introducing Window Functions in Spark SQLJuly 15, 2015 by Yin Huai and Michael Armbrust in Engineering Blog Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. In this blog post...
Deep Dive into Spark SQL's Catalyst OptimizerApril 13, 2015 by Michael Armbrust, Yin Huai, Cheng Liang, Reynold Xin and Matei Zaharia in Engineering Blog Check out the Why the Data Lakehouse is Your Next Data Warehouse ebook to discover the inner workings of the Databricks Lakehouse Platform...
What's new for Spark SQL in Apache Spark 1.3March 24, 2015 by Michael Armbrust in Engineering Blog Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...
Introducing DataFrames in Apache Spark for Large Scale Data ScienceFebruary 17, 2015 by Reynold Xin, Michael Armbrust and Davies Liu in Engineering Blog Today, we are excited to announce a new DataFrame API designed to make big data processing even easier for a wider audience. When...
Spark SQL Data Sources API: Unified Data Access for the Apache Spark PlatformJanuary 9, 2015 by Michael Armbrust in Engineering Blog Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...
Spark SQL: Manipulating Structured Data Using Apache SparkMarch 26, 2014 by Michael Armbrust and Reynold Xin in Engineering Blog Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...