How to perform change data capture (CDC) from full table snapshots using Delta Live TablesAugust 26, 2024 by Mojgan Mazouchi and Ganesh Chand in Engineering Blog All the code is available in this GitHub repository . Prior to reading this blog we recommend reading Getting Started with Delta Live...
A Deep Dive into the Latest Performance Improvements of Stateful Pipelines in Apache Spark Structured StreamingFebruary 28, 2024 by Mojgan Mazouchi, Mrityunjay Kumar, Anish Shrigondekar and Karthikeyan Ramasamy in Engineering Blog This post is the second part of our two-part series on the latest performance improvements of stateful pipelines. The first part of this...
Performance Improvements for Stateful Pipelines in Apache Spark Structured StreamingFebruary 28, 2024 by Mojgan Mazouchi, Mrityunjay Kumar, Anish Shrigondekar and Karthikeyan Ramasamy in Engineering Blog Introduction Apache Spark™ Structured Streaming is a popular open-source stream processing platform that provides scalability and fault tolerance, built on top of the...
Build a Customer 360 Solution with Fivetran and Delta Live TablesNovember 9, 2022 by Mojgan Mazouchi, Shivam Panicker, Prasad Kona and Bilal Aslam in Engineering Blog The Databricks Lakehouse Platform is an open architecture that combines the best elements of data lakes and data warehouses. In this blog post...
Simplifying Change Data Capture With Databricks Delta Live TablesApril 25, 2022 by Mojgan Mazouchi in Engineering Blog This guide will demonstrate how you can leverage Change Data Capture in Delta Live Tables pipelines to identify new records and capture changes...