Skip to main content
<
Page 2
>
Engineering blog

Data Exfiltration Protection with Azure Databricks

In the previous blog , we discussed how to securely access Azure Data Services from Azure Databricks using Virtual Network Service Endpoints or...
Engineering blog

Implementing LLM Guardrails for Safe and Responsible Generative AI Deployment on Databricks

Introduction Let’s explore a common scenario – your team is eager to leverage open source LLMs to build chatbots for customer support interactions...
Engineering blog

Announcing the General Availability of Databricks Feature Serving

Today, we are excited to announce the general availability of Feature Serving. Features play a pivotal role in AI Applications, typically requiring considerable...
Platform blog

Databricks Expands Brickbuilder Program to Include Unity Catalog Accelerators

March 7, 2024 by Christine Gauthier in Partners
Today, we're excited to announce the launch of Brickbuilder Unity Catalog Accelerators. This is an expansion to the Brickbuilder Accelerator program , which...
Engineering blog

Simplify PySpark testing with DataFrame equality functions

The DataFrame equality test functions were introduced in Apache Spark™ 3.5 and Databricks Runtime 14.2 to simplify PySpark unit testing. The full set...
Engineering blog

A Deep Dive into the Latest Performance Improvements of Stateful Pipelines in Apache Spark Structured Streaming

This post is the second part of our two-part series on the latest performance improvements of stateful pipelines. The first part of this...
Engineering blog

Performance Improvements for Stateful Pipelines in Apache Spark Structured Streaming

Introduction Apache Spark™ Structured Streaming is a popular open-source stream processing platform that provides scalability and fault tolerance, built on top of the...
Platform blog

Databricks adds new migration Brickbuilder Solutions to help customers succeed with AI

February 15, 2024 by Christine Gauthier in Partners
For the past two years, Databricks has collaborated with leading consulting partners to build innovative solutions for industry, migration, and data and AI...
Engineering blog

Announcing Ray Autoscaling support on Databricks and Apache Spark™

Ray is an open-source unified compute framework that simplifies scaling AI and Python workloads in a distributed environment. Since we introduced support for...
Engineering blog

Parameterized queries with PySpark

PySpark has always provided wonderful SQL and Python APIs for querying data. As of Databricks Runtime 12.1 and Apache Spark 3.4, parameterized queries...