Skip to main content
Company Blog

Today, we’re excited to announce MLflow v0.4.0, MLflow v0.4.1, and v0.4.2 which we released within the last week with some of the recently requested features. MLflow 0.4.2 is already available on PyPI and docs are updated. If you do pip install mlflow as described in the MLflow quickstart guide, you will get the recent release.

In this post, we’ll describe the new features and fixes in this release.

Azure Blob Storage Artifact Support

As part of MLflow 0.4.0, we’ve added support for storing artifacts in Azure Blob Storage, through the --default-artifact-root parameter to the mlflow server command. This makes it easy to run MLflow training jobs on multiple Azure cloud VMs and track results across them. The following example shows how to launch the tracking server with an Azure Blob Storage artifact store. You will need to set the AZURE_STORAGE_CONNECTION_STRING environment variable as noted in MLflow Tracking > Azure Blob Storage.

mlflow server --default-artifact-root wasbs://$container@$account.blob.core.windows.net/

Using MLflow with PyTorch and Tensorboard

We’ve added some samples that include advanced tracking, including a PyTorch TensorBoard Sample with the following MLflow UI and TensorBoard output.

https://www.youtube.com/watch?v=3un20Ey9iPY

https://www.youtube.com/watch?v=sxzyEqxVFS8

H2O Integration

Thanks to PR 170, MLflow now includes support for H2O model export and serving.

 

Other Features and Bug Fixes

In addition to these features, other items, bugs and documentation fixes are included in these releases. Some items worthy of note are:

  •  MLflow experiments REST API and mlflow experiments create now support providing --artifact-location (Issue #232)
  • [UI] Show GitHub links in the UI for projects run from http(s):// GitHub URLs (Issue #235)
  • Fix Spark model support when saving/loading models to/from distributed filesystems (Issue #180)
  • [Tracking] GCS artifact storage is now a pluggable dependency (no longer installed by default). To enable GCS support, install google-cloud-storage on both the client and tracking server via pip. (Issue #202)
  • [Projects] Support for running projects in subdirectories of Git repos (Issue #153)
  •  [SageMaker] Support for specifying a compute specification when deploying to SageMaker (Issue #185)

The full list of changes and contributions from the community can be found in the 0.4.2 Changelog. We welcome more input on [email protected] or by filing issues or submitting patches on GitHub. For real-time questions about MLflow, we’ve also recently created a Slack channel for MLflow as well as you can follow @MLflow on Twitter.

Watch More

Watch Introduction to MLflow with Matei Zaharia from the Seattle Spark+AI Meetup on August 2, 2018; thanks to Scott Klein for the recording.

Credits

MLflow 0.4.2 includes patches from Aaron Davidson, Andrew Chen, Arinto Murdopom, Corey Zumar, Javier Luraschi, Joel Akeret, Juntai Zheng, Matei Zaharia, Siddharth Murching, Stephanie Bodoff, Tomas Nykodym, Toon Baeyen

Try Databricks for free

Related posts

Engineering blog

7 Reasons to Learn PyTorch on Databricks

April 14, 2021 by Jules Damji in Engineering Blog
What expedites the process of learning new concepts, languages or systems? When learning a new task, do you look for analogs from skills...
Company blog

Automated Background Removal in E-commerce Fashion Image Processing Using PyTorch on Databricks

April 29, 2021 by Simona Stolnicu in Company Blog
This is a guest blog from Simona Stolnicu, a data scientist and machine learning engineer at Wehkamp, an e-commerce company, where her team...
Engineering blog

GPU-accelerated Sentiment Analysis Using Pytorch and Huggingface on Databricks

Sentiment analysis is commonly used to analyze the sentiment present within a body of text, which could range from a review, an email...
See all Company Blog posts