Skip to main content
Page 1

Linking the unlinkables; simple, automated, scalable data linking with Databricks ARC

In April 2023 we announced the release of Databricks ARC to enable simple, automated linking of data within a single table. Today we...

Improving Public Sector Decision Making With Simple, Automated Record Linking

What is data linking and why does it matter? Availability of more, high quality data is a critical enabler for better decision making...

Best practices for cross-government data sharing

Government data exchange is the practice of sharing data between different government agencies and often partners in commercial sectors. Government can share data...

Better Data for Better Decisions in the Public Sector Through Entity Resolution - Part 1

One of the domains where better decisions mean a better society is the Public Sector. Each and every one of us has a...

Efficient Point in Polygon Joins via PySpark and BNG Geospatial Indexing

This is a collaborative post by Ordnance Survey, Microsoft and Databricks. We thank Charis Doidge, Senior Data Engineer, and Steve Kingston, Senior Data...

Make Your RStudio on Databricks More Durable and Resilient

August 19, 2021 by Milos Colic and Robert Whiffin in
One of the questions that we often hear from our customers these days is, “Should I develop my solution in Python or R?”...