Skip to main content
Page 1
>

Training Highly Scalable Deep Recommender Systems on Databricks (Part 1)

Recommender systems (RecSys) have become an integral part of modern digital experiences, powering personalized content suggestions across various platforms. These sophisticated systems and...

Adaptive Query Execution in Structured Streaming

In Databricks Runtime, Adaptive Query Execution (AQE) is a performance feature that continuously re-optimizes batch queries using runtime statistics during query execution. Starting...

Latency goes subsecond in Apache Spark Structured Streaming

Apache Spark Structured Streaming is the leading open source stream processing platform. It is also the core technology that powers streaming on the...

How Collective Health uses Delta Live Tables and Structured Streaming for Data Integration

April 13, 2023 by Mragesh Khandelwal and Mahmoud Saleh in
Collective Health is not an insurance company. We're a technology company that's fundamentally making health insurance work better for everyone— starting with the...

Scalable Spark Structured Streaming for REST API Destinations

March 1, 2023 by Art Rask and Jay Palaniappan in
Spark Structured Streaming is the widely-used open source engine at the foundation of data streaming on the Databricks Lakehouse Platform . It can...

Build Reliable and Cost Effective Streaming Data Pipelines With Delta Live Tables’ Enhanced Autoscaling

This year we announced the general availability of Delta Live Tables (DLT) , the first ETL framework to use a simple, declarative approach...

Python Arbitrary Stateful Processing in Structured Streaming

October 17, 2022 by Hyukjin Kwon and Jungtaek Lim in
More and more customers are using Databricks for their real-time analytics and machine learning workloads to meet the ever increasing demand of their...

State Rebalancing in Structured Streaming

In light of the accelerated growth and adoption of Apache Spark Structured Streaming, Databricks announced Project Lightspeed at Data + AI Summit 2022...

Using Streaming Delta Live Tables and AWS DMS for Change Data Capture From MySQL

September 29, 2022 by Neil Patel in
In this article we will walk you through the steps to create an end-to-end CDC pipeline with Terraform using Delta Live Tables, AWS...

Databricks at Current 2022

Current 2022 , organized by Confluent, is the first-ever data streaming industry event – and it's coming up soon! No matter where you...