Session
Data Preparation for Machine Learning
Overview
Experience | In Person |
---|---|
Type | Paid Training |
Duration | 240 min |
In this course, you’ll learn the fundamentals of preparing data for machine learning using Databricks. We’ll cover topics like exploring, cleaning, and organizing data tailored for traditional machine learning applications. We’ll also cover data visualization, feature engineering, and optimal feature storage strategies.
Pre-requisites: Familiarity with Databricks workspace, notebooks, as well as Unity Catalog. An intermediate level knowledge of Python (scikit-learn, Matplotlib), Pandas, and PySpark. As well as with concepts of exploratory data analysis, feature engineering, standardization, and imputation methods).
Labs: Yes
Certification Path: Databricks Certified Machine Learning Associate