Introducing Apache Spark™ 3.5September 15, 2023 by Yuanjian Li, Daniel Tenedorio, Martin Grund, Allan Folting, Hyukjin Kwon, Herman van Hövell, Wenchen Fan, Weichen Xu, Gengliang Wang, Allison Wang, Jungtaek Lim, Xiao Li and Reynold Xin in Engineering Blog Today, we are happy to announce the availability of Apache Spark™ 3.5 on Databricks as part of Databricks Runtime 14.0. We extend our...
Spark Connect Available in Apache Spark 3.4April 18, 2023 by Allan Folting, Hyukjin Kwon, Xiao Li, Herman van Hövell, Stefania Leone, Martin Grund, Reynold Xin and Kris Mo in Engineering Blog Last year Spark Connect was introduced at the Data and AI Summit. As part of the recently released Apache SparkTM 3.4, Spark Connect...
Introducing Apache Spark™ 3.4 for Databricks Runtime 13.0April 14, 2023 by Xinrong Meng, Daniel Tenedorio, Martin Grund, Allan Folting, Hyukjin Kwon, Herman van Hövell, Wenchen Fan, Ying Xiong, Jungtaek Lim, Xiao Li and Reynold Xin in Engineering Blog Today, we are happy to announce the availability of Apache Spark™ 3.4 on Databricks as part of Databricks Runtime 13.0 . We extend...
Introducing Apache Spark™ 3.3 for Databricks Runtime 11.0June 15, 2022 by Maxim Gekk, Wenchen Fan, Hyukjin Kwon, Serge Rielau, Yingyi Bu, Xiao Li and Reynold Xin in Engineering Blog Today we are happy to announce the availability of Apache Spark™ 3.3 on Databricks as part of Databricks Runtime 11.0 . We want...
Introducing Apache Spark™ 3.2October 19, 2021 by Gengliang Wang, Wenchen Fan, Hyukjin Kwon, Xiao Li and Reynold Xin in Engineering Blog We are excited to announce the availability of Apache Spark™ 3.2 on Databricks as part of Databricks Runtime 10.0 . We want to...
Introducing Apache Spark™ 3.1March 2, 2021 by Hyukjin Kwon, Wenchen Fan, Xiao Li and Reynold Xin in Engineering Blog We are excited to announce the availability of Apache Spark 3.1 on Databricks as part of Databricks Runtime 8.0 . We want to...
Improving the Spark Exclusion Mechanism in DatabricksNovember 6, 2020 by Tianhan Hu, Xingbo Jiang and Xiao Li in Engineering Blog Ed Note: This article contains references to the term blacklist, a term that the Spark community is actively working to remove from Spark...
Interoperability between Koalas and Apache SparkAugust 11, 2020 by Takuya Ueshin, Hyukjin Kwon and Xiao Li in Solutions Koalas is an open source project which provides a drop-in replacement for pandas, enabling efficient scaling out to hundreds of worker nodes for...
Introducing Koalas 1.0June 24, 2020 by Hyukjin Kwon, Takuya Ueshin and Xiao Li in Product Koalas was first introduced last year to provide data scientists using pandas with a way to scale their existing big data workloads by...
Introducing Apache Spark 3.0June 18, 2020 by Matei Zaharia, Reynold Xin, Xiao Li, Wenchen Fan and Yin Huai in Product We’re excited to announce that the Apache Spark TM 3.0.0 release is available on Databricks as part of our new Databricks Runtime 7.0...