Solutions | Databricks Blog

Page 13

Deep Learning Tutorial Demonstrates How to Simplify Distributed Deep Learning Model Inference Using Delta Lake and Apache Spark™

November 20, 2019 by Cyrielle Simeone in Platform

On October 10th, our team hosted a live webinar— Simple Distributed Deep Learning Model Inference —with Xiangrui Meng, Software Engineer at Databricks. Model...

Scaling Hyperopt to Tune Machine Learning Models in Python

October 28, 2019 by Joseph Bradley and Max Pumperla in Solutions

Try the Hyperopt notebook to reproduce the steps outlined below and watch our on-demand webinar to learn more. Hyperopt is one of the...

Delta Lake Now Hosted by the Linux Foundation to Become the Open Standard for Data Lakes

October 15, 2019 by Michael Armbrust and Reynold Xin in Platform

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. At today’s Spark +...

Simple, Reliable Upserts and Deletes on Delta Lake Tables using Python APIs

October 3, 2019 by Tathagata Das and Denny Lee in Solutions

We are excited to announce the release of Delta Lake 0.4.0 which introduces Python APIs for manipulating and managing data in Delta tables...

Parallelizing SAIGE Across Hundreds of Cores

October 2, 2019 by Karen Feng, Henry Davidge and Frank Austin Nothaft in Engineering

As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark™. There are many ways...

Diving Into Delta Lake: Schema Enforcement & Evolution

September 23, 2019 by Burak Yavuz, Brenner Heintz and Denny Lee in Company

Try this notebook series in Databricks Data, like our experiences, is always evolving and accumulating. To keep up, our mental models of the...

Doing Multivariate Time Series Forecasting with Recurrent Neural Networks

September 10, 2019 by Vedant Jain in Engineering

Try this notebook in Databricks Time Series forecasting is an important area in Machine Learning. It can be difficult to build accurate models...

Guest Blog: How Virgin Hyperloop One Reduced Processing Time from Hours to Minutes with Koalas

August 22, 2019 by Patryk Oleniuk and Sandhya Raghavan in Solutions

Watch the on-demand webinar to learn more: From pandas to Koalas: reducing Time-to-Insights for Virgin Hyperloop's Data At Virgin Hyperloop One, we work...

Diving Into Delta Lake: Unpacking The Transaction Log

August 20, 2019 by Burak Yavuz, Michael Armbrust and Brenner Heintz in Company

The transaction log is key to understanding Delta Lake because it is the common thread that runs through many of its most important...

Announcing Databricks Runtime 5.5 with Conda (Beta)

July 24, 2019 by Yifan Cao in Company

Databricks is pleased to announce the release of Databricks Runtime 5.5 with Conda (Beta). We introduced Databricks Runtime 5.4 with Conda (Beta), with...