Skip to main content
<
Page 32
>

Accelerating ML Experimentation in MLflow

February 10, 2021 by Andrew Nitu in
This fall, I interned with the ML team, which is responsible for building the tools and services that make it easy to do...

Automatically Evolve Your Nested Column Schema, Stream From a Delta Table Version, and Check Your Constraints

We recently announced the release of Delta Lake 0.8.0 , which introduces schema evolution and performance improvements in merge and operational metrics in...

How Data Lakehouses Solve Common Issues With Data Warehouses

February 4, 2021 by Ryan Boyd in
Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Ray & MLflow: Taking Distributed Machine Learning Applications to Production

This is a guest blog from software engineers Amog Kamsetty and Archit Kulkarni of Anyscale and contributors to Ray.io In this blog post...

Strategies for Modernizing Investment Data Platforms

January 29, 2021 by Ricardo Portilla in
The appetite for investment was at a historic high in 2020 for both individual and institutional investors. One study showed that "retail traders...

How to Accelerate Demand Planning From 4.5 Hours to Under 1 Hour With Azure Databricks

January 28, 2021 by Adam Wasserman and Clinton Ford in
The importance of supply chain analytics Rapid changes in consumer purchase behavior can have a material impact on supply chain planning, inventory management...

Burning Through Electronic Health Records in Real Time With Smolder

Check out the solution accelerator to download the notebook referred throughout this blog. In previous blogs , we looked at two separate workflows...

Combining Rules-based and AI Models to Combat Financial Fraud

The financial services industry (FSI) is rushing towards transformational change, delivering transactional features and facilitating payments through new digital channels to remain competitive...

Leveling the Playing Field: HorovodRunner for Distributed Deep Learning Training

January 14, 2021 by Jing Pan and Wendao Liu in
This is a guest post authored by Sr. Staff Data Scientist/User Experience Researcher Jing Pan and Senior Data Scientist Wendao Liu of leading...

Bayesian Modeling of the Temporal Dynamics of COVID-19 Using PyMC3

In this post, we look at how to use PyMC3 to infer the disease parameters for COVID-19. PyMC3 is a popular probabilistic programming...