Skip to main content
Page 1
Engineering blog

Burning Through Electronic Health Records in Real Time With Smolder

Check out the solution accelerator to download the notebook referred throughout this blog. In previous blogs , we looked at two separate workflows...
Engineering blog

Detecting At-risk Patients with Real World Data

With the rise of low cost genome sequencing and AI-enabled medical imaging, there has been substantial interest in precision medicine. In precision medicine...
Company blog

Introducing GlowGR: An industrial-scale, ultra-fast and sensitive method for genetic association studies

Today, we announce that we are making a new whole genome regression method available to the open source bioinformatics community as part of...
Engineering blog

Building a Modern Clinical Health Data Lake with Delta Lake

The healthcare industry is one of the biggest producers of data. In fact, the average healthcare organization is sitting on nearly 9 petabytes...
Engineering blog

Automating Digital Pathology Image Analysis with Machine Learning on Databricks

Check out our solution accelerator for automating digital pathology analysis or watch our on-demand webinar to learn more. With technological advancements in imaging...
Platform blog

Introducing Glow: An Open-Source Toolkit for Large-Scale Genomic Analysis

The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the...
Engineering blog

Parallelizing SAIGE Across Hundreds of Cores

As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark™. There are many ways...
Company blog

Engineering population scale Genome-Wide Association Studies with Apache Spark™, Delta Lake, and MLflow

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Try this notebook series...
Company blog

Monitor Medical Device Data with Machine Learning using Delta Lake, Keras and MLflow: On-Demand Webinar and FAQs now available!

September 12, 2019 by Michael Ortega and Frank Austin Nothaft in Company Blog
On August 20th, our team hosted a live webinar— Automated Monitoring of Medical Device Data with Data Science —with Frank Austin Nothaft, PhD...
Engineering blog

Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQL

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is the second...
Engineering blog

Simplifying Genomics Pipelines at Scale with Databricks Delta

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Try this notebook in...
Engineering blog

Building the Fastest DNASeq Pipeline at Scale

In June, we announced the Unified Analytics Platform for Genomics with a simple goal: accelerate discovery with a collaborative platform for interactive genomic...
Company blog

Accelerating Discovery with a Unified Analytics Platform for Genomics

Today we are proud to introduce the Databricks Unified Analytics Platform for Genomics. With a unified platform for genomic data processing, tertiary analytics...