Skip to main content
<
Page 12
>

Simple, Reliable Upserts and Deletes on Delta Lake Tables using Python APIs

October 3, 2019 by Tathagata Das and Denny Lee in
We are excited to announce the release of Delta Lake 0.4.0 which introduces Python APIs for manipulating and managing data in Delta tables...

Diving Into Delta Lake: Schema Enforcement & Evolution

September 23, 2019 by Burak Yavuz, Brenner Heintz and Denny Lee in
Try this notebook series in Databricks Data, like our experiences, is always evolving and accumulating. To keep up, our mental models of the...

Engineering population scale Genome-Wide Association Studies with Apache Spark™, Delta Lake, and MLflow

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Try this notebook series...

Guest Blog: How Virgin Hyperloop One Reduced Processing Time from Hours to Minutes with Koalas

August 22, 2019 by Patryk Oleniuk and Sandhya Raghavan in
Watch the on-demand webinar to learn more: From pandas to Koalas: reducing Time-to-Insights for Virgin Hyperloop's Data At Virgin Hyperloop One, we work...

Diving Into Delta Lake: Unpacking The Transaction Log

The transaction log is key to understanding Delta Lake because it is the common thread that runs through many of its most important...

Announcing Databricks Runtime 5.5 and Runtime 5.5 for Machine Learning

July 16, 2019 by Yifan Cao in
Databricks is pleased to announce the release of Databricks Runtime 5.5. This release includes Apache Spark 2.4.3 along with several important improvements and...

Scaling Genomic Workflows with Spark SQL BGEN and VCF Readers

June 26, 2019 by Henry Davidge in
Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQL

June 19, 2019 by Frank Austin Nothaft and Karen Feng in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is the second...

Simplifying Streaming Stock Analysis using Delta Lake and Apache Spark: On-Demand Webinar and FAQ Now Available!

June 18, 2019 by John O'Dwyer, Navin Albert and Denny Lee in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. On June 13th, we...

Detecting Data Bias Using SHAP and Machine Learning

June 17, 2019 by Sean Owen in
Try the Detecting Data Bias Using SHAP notebook to reproduce the steps outlined below and watch our on-demand webinar to learn more. StackOverflow's...