Amirhessam Tahmassebi

Data Scientist, ExxonMobil

Amirhessam Tahmassebi is a Data Scientist at ExxonMobil and lead in design, development, and implementation of the solution for HDPE sector of Dynamic Revenue Management for ExxonMobil Chemical Company (EMCC). Amir received his PhD degree in Computational Science from Florida State University.

UPCOMING SESSIONS

PAST SESSIONS

Productionizing Machine Learning Pipelines with Databricks and Azure MLSummit 2020

Deployment of modern machine learning applications can require a significant amount of time, resources, and experience to design and implement – thus introducing overhead for small-scale machine learning projects.

In this tutorial, we present a reproducible framework for quickly jumpstarting data science projects using Databricks and Azure Machine Learning workspaces that enables easy production-ready app deployment for data scientists in particular. Although the example presented in the session focuses on deep learning, the workflow can be extended to other traditional machine learning applications as well.

The tutorial will include sample-code with templates and recommended project organization structure and tools, along with shared key learnings from our experiences in deploying machine learning pipelines into production and distributing a repeatable framework within our organization.

What you will learn:

  • Understand how to develop pipelines for continuous integration and deployment within Azure Machine Learning using Azure Databricks for an end-to-end ML application
  • Learn how to execute Apache Spark jobs using Databricks Connect and integrating source code with Azure DevOps for version control.
  • Hands-on experience building deep learning models for image classification with Tensorflow and PyTorch
  • Address challenges of the ML lifecycle by implementing MLflow for tracking model. parameters/results, packaging code for reproducibility, and deploying models.

Prerequisites:

  • Microsoft Azure Account
  • Azure Machine Learning Workspace and Azure DevOps Configured
  • Pre-Register for a Databricks Standard Trial (runtime > 6.0)
  • Docker installed and configured (Linux)
  • Basic knowledge of Python, Spark and deep learning concepts
  • Python 3.7.5 virtual environment with the following libraries
    • Poetry==1.0.5
    • Databricks-connect==6.1.*
    • azureml-core ==1.2.0
    • azureml-pipeline-core==1.2.0
    • tensorflow-cpu==2.1
    • torch==1.1.0
    • mlflow==1.8.0
    • databricks-cli==0.10.0