Skip to main content
<
Page 6
>

Developing Databricks' Runbot CI Solution

October 14, 2021 by Li Haoyi in
Runbot is a bespoke continuous integration (CI) solution developed specifically for Databricks' needs. Originally developed in 2019, Runbot incrementally replaces our aging Jenkins...

How YipitData Extracts Insights From Alternative Data Using Delta Lake

September 21, 2021 by Anup Segu and Bobby Muldoon in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is a guest...

Real-time Point-of-Sale Analytics With a Data Lakehouse

September 9, 2021 by Bryan Smith and Rob Saker in
Disruptions in the supply chain – from reduced product supply and diminished warehouse capacity – coupled with rapidly shifting consumer expectations for seamless...

How Incremental ETL Makes Life Simpler With Data Lakes

August 30, 2021 by John O'Dwyer in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Incremental ETL (Extract, Transform...

Improving On-Shelf Availability for Items With AI Out of Stock Modeling

This post was written in collaboration with Databricks partner Tredence. We thank Rich Williams, Vice President Data Engineering, and Morgan Seybert, Chief Business...

Solution Accelerator: Multi-touch Attribution

August 23, 2021 by Debu Sinha and Dan Morris in
Behind the growth of every consumer-facing product is the acquisition and retention of an engaged user base. When it comes to customer acquisition...

Getting Started With Ingestion into Delta Lake

July 23, 2021 by John O'Dwyer and Nancy Shah in
Ingesting data can be hard and complex since you either need to use an always-running streaming platform like Kafka or you need to...

How to Build a Scalable Wide and Deep Product Recommender

Download the notebooks referenced throughout this article. I have a favorite coffee shop I've been visiting for years. When I walk in, the...

How to Simplify CDC With Delta Lake's Change Data Feed

Try this notebook in Databricks Change data capture (CDC) is a use case that we see many customers implement in Databricks – you...

Machine Learning-based Item Matching for Retailers and Brands

Item matching is a core function in online marketplaces. To ensure an optimized customer experience, retailers compare new and updated product information against...