Skip to main content
Page 1

Introducing Easier Change Data Capture in Apache Spark™ Structured Streaming

January 27, 2025 by Craig Lukasik and Anish Shrigondekar in
This blog describes the new change feed and snapshot capabilities in Apache Spark™ Structured Streaming’s State Reader API. The State Reader API enables...

Simplify Data Ingestion With the New Python Data Source API

December 10, 2024 by Craig Lukasik and Allison Wang in
Data engineering teams are frequently tasked with building bespoke ingestion solutions for myriad custom, proprietary, or industry-specific data sources. Many teams find that...

Announcing the State Reader API: The New "Statestore" Data Source

March 28, 2024 by Craig Lukasik and Jungtaek Lim in
Databricks Runtime 14.3 includes a new capability that allows users to access and analyze Structured Streaming 's internal state data: the State Reader...