Skip to main content
<
Page 116
>

Simplifying Data + AI, One Line of TypeScript at a Time

October 21, 2021 by Reynold Xin and Matei Zaharia in
Today, Databricks is known for our backend engineering, building and operating cloud systems that span millions of virtual machines processing exabytes of data...

Introducing SQL User-Defined Functions

October 20, 2021 by Serge Rielau and Allison Wang in
A user-defined function (UDF) is a means for a user to extend the native capabilities of Apache Spark™ SQL. SQL on Databricks has...

Introducing Apache Spark™ 3.2

We are excited to announce the availability of Apache Spark™ 3.2 on Databricks as part of Databricks Runtime 10.0 . We want to...

MLflow for Bayesian Experiment Tracking

October 18, 2021 by Srijith Rajamohan, Ph.D. in
This post is the third in a series on Bayesian inference ( [1] , [2] ). Here we will illustrate how to use...

Creating an IP Lookup Table of Activities in a SIEM Architecture

October 18, 2021 by Sepideh Ebrahimi and Andy Hutchinson in
When working with cyber security data, one thing is for sure: there is no shortage of available data sources. If anything, there are...

Developing Databricks' Runbot CI Solution

October 14, 2021 by Li Haoyi in
Runbot is a bespoke continuous integration (CI) solution developed specifically for Databricks' needs. Originally developed in 2019, Runbot incrementally replaces our aging Jenkins...

Native Support of Session Window in Spark Structured Streaming

October 12, 2021 by Jungtaek Lim, Yuanjian Li and Shixiong Zhu in
Apache Spark™ Structured Streaming allowed users to do aggregations on windows over event-time . Before Apache Spark 3.2™, Spark supported tumbling windows and...

Efficient Point in Polygon Joins via PySpark and BNG Geospatial Indexing

This is a collaborative post by Ordnance Survey, Microsoft and Databricks. We thank Charis Doidge, Senior Data Engineer, and Steve Kingston, Senior Data...

5 Steps to Get Started With Databricks on Google Cloud

October 8, 2021 by Hiral Jasani and Dhruv Kumar in
Since we launched Databricks on Google Cloud earlier this year, we’ve been thrilled to see stories about the value this joint solution has...

Databricks Repos Is Now Generally Available - New ‘Files’ Feature in Public Preview

October 7, 2021 by Ka-Hing Cheung and Vaibhav Sethi in
Thousands of Databricks customers have adopted Databricks Repos since its public preview and have standardized on it for their development and production workflows...