Skip to main content
Page 1
Industries category icon 1

Linking the unlinkables; simple, automated, scalable data linking with Databricks ARC

In April 2023 we announced the release of Databricks ARC to enable simple, automated linking of data within a single table. Today we...
Platform blog

Fleet optimization with CARTO & Databricks

Effective delivery has become increasingly important for businesses in recent years, particularly for logistics companies and those in the consumer packaged goods (CPG)...
Industries category icon 2

Improving Public Sector Decision Making With Simple, Automated Record Linking

What is data linking and why does it matter? Availability of more, high quality data is a critical enabler for better decision making...
Engineering blog

Unsupervised Outlier Detection on Databricks

Kakapo ( KAH-kə-poh ) implements a standard set of APIs for outlier detection at scale on Databricks. It provides an integration of the...
Platform blog

Best practices for cross-government data sharing

Government data exchange is the practice of sharing data between different government agencies and often partners in commercial sectors. Government can share data...
Industries category icon 3

Better Data for Better Decisions in the Public Sector Through Entity Resolution - Part 1

One of the domains where better decisions mean a better society is the Public Sector. Each and every one of us has a...
Engineering blog

Building Geospatial Data Products

January 6, 2023 by Milos Colic in Engineering Blog
Geospatial data has been driving innovation for centuries, through use of maps, cartography and more recently through digital content. For example, the oldest...
Platform blog

Security Best Practices for Delta Sharing

Update: Delta Sharing is now generally available on AWS and Azure. The data lakehouse has enabled us to consolidate our data management architectures...
Engineering blog

Designing a Java Connector for Delta Sharing Recipient

June 29, 2022 by Milos Colic and Vuong Nguyen in Engineering Blog
Making an open data marketplace Stepping into this brave new digital world we are certain that data will be a central product for...
Engineering blog

Arcuate - Machine Learning Model Exchange With Delta Sharing and MLflow

Stepping into this brave new digital world we are certain that data will be a central product for many organizations. The way to...
Platform blog

High Scale Geospatial Processing With Mosaic

Breaking through the scale barrier (discussing existing challenges) At Databricks, we are hyper-focused on supporting users along their data modernization journeys. A growing...
Engineering blog

Implementing the GDPR 'Right to be Forgotten' in Delta Lake

Databricks' Lakehouse platform empowers organizations to build scalable and resilient data platforms that allow them to drive value from their data. As the...
Engineering blog

Efficient Point in Polygon Joins via PySpark and BNG Geospatial Indexing

This is a collaborative post by Ordnance Survey, Microsoft and Databricks. We thank Charis Doidge, Senior Data Engineer, and Steve Kingston, Senior Data...
Engineering blog

Make Your RStudio on Databricks More Durable and Resilient

August 19, 2021 by Milos Colic and Robert Whiffin in Engineering Blog
One of the questions that we often hear from our customers these days is, “Should I develop my solution in Python or R?”...
Engineering blog

Improving Customer Experience With Transaction Enrichment

May 10, 2021 by Milos Colic in Engineering Blog
The retail banking landscape has dramatically changed over the past five years with the accessibility of open banking applications, mainstream adoption of Neobanks...