Unity Catalog Lakeguard: Industry-first and only data governance for multi-user Apache Spark™ clustersApril 25, 2024 by Stefania Leone, Martin Grund, Herman van Hövell, Reynold Xin and Matei Zaharia in Platform & Products & Announcements We are thrilled to announce Unity Catalog Lakeguard , which allows you to run Apache Spark™ workloads in SQL, Python, and Scala with...
Introducing Apache Spark™ 3.5September 15, 2023 by Yuanjian Li, Daniel Tenedorio, Martin Grund, Allan Folting, Hyukjin Kwon, Herman van Hövell, Wenchen Fan, Weichen Xu, Gengliang Wang, Allison Wang, Jungtaek Lim, Xiao Li and Reynold Xin in Engineering Blog Today, we are happy to announce the availability of Apache Spark™ 3.5 on Databricks as part of Databricks Runtime 14.0. We extend our...
Shared Clusters in Unity Catalog for the win: Introducing Cluster Libraries, Python UDFs, Scala, Machine Learning and more September 4, 2023 by Jakob Mund, Stefania Leone, Martin Grund, Herman van Hövell, Andrew Li and Sven Wagner-Boysen in Engineering Blog We are thrilled to announce that you can run even more workloads on Databricks’ highly efficient multi-user clusters thanks to new security and...
Spark Connect Available in Apache Spark 3.4April 18, 2023 by Allan Folting, Hyukjin Kwon, Xiao Li, Herman van Hövell, Stefania Leone, Martin Grund, Reynold Xin and Kris Mo in Engineering Blog Last year Spark Connect was introduced at the Data and AI Summit. As part of the recently released Apache SparkTM 3.4, Spark Connect...
Use Databricks from anywhere with Databricks Connect “v2”April 18, 2023 by Stefania Leone, Martin Grund, Vladislav Mantic-Lugo and Niranjan Jayakar in Platform Blog We are thrilled to announce the public preview of Databricks Connect "v2", which enables developers to use the power of Databricks from any...
Introducing Apache Spark™ 3.4 for Databricks Runtime 13.0April 14, 2023 by Xinrong Meng, Daniel Tenedorio, Martin Grund, Allan Folting, Hyukjin Kwon, Herman van Hövell, Wenchen Fan, Ying Xiong, Jungtaek Lim, Xiao Li and Reynold Xin in Engineering Blog Today, we are happy to announce the availability of Apache Spark™ 3.4 on Databricks as part of Databricks Runtime 13.0 . We extend...
Power to the SQL People: Introducing Python UDFs in Databricks SQLJuly 22, 2022 by Martin Grund, Herman van Hövell, Stefania Leone and Jakob Mund in Platform Blog We were thrilled to announce the preview for Python User-Defined Functions (UDFs) in Databricks SQL (DBSQL) at last month's Data and AI Summit...
Introducing Spark Connect - The Power of Apache Spark, EverywhereJuly 7, 2022 by Stefania Leone, Martin Grund, Herman van Hövell and Reynold Xin in Engineering Blog At last week's Data and AI Summit, we highlighted a new project called Spark Connect in the opening keynote. This blog post walks...