Skip to main content

Get Started with Databricks Platform Administration

In this course, you will learn the basics of platform administration on the Databricks Data Intelligence Platform. It offers a comprehensive overview of the Unity Catalog, a vital component for effective data governance within Databricks environments. Divided into five modules, it begins with a detailed introduction to Databricks infrastructure and its data intelligence platform, including an in-depth walkthrough of the Databricks Workspace. You will explore data governance principles within Unity Catalog, covering its key concepts, architecture, and roles. The course further emphasizes managing Unity Catalog metastores and compute resources, including clusters and SQL warehouses. Finally, you'll master data access control by learning about privileges, fine-grained access, and how to govern data objects. By the end, you will be equipped with essential skills to administer the Unity Catalog to implement effective data governance, optimize compute resources, and enforce robust data security strategies.

Skill Level
Onboarding
Duration
2h
Prerequisites

The content was developed for participants with these skills/knowledge/abilities:

  • Basic knowledge of cloud computing and SQL concepts such as networking basics, SQL commands, aggregate functions, filters and sorting, indexes, tables, and views.
  • Basic knowledge of Python programming, Jupyter notebook interface, and PySpark fundamentals.

Outline

Databricks Overview

  • Databricks Infrastructure
  • Databricks Data Intelligence Platform
  • Unity Catalog Overview
  • Databricks Workspace Walkthrough

Databricks Platform Administration

  • Data Governance in Unity Catalog
  • Managing Principles in Unity Catalog
  • Managing Unity Catalog Metastores
  • Compute Resources and Unity Catalog
  • Data Access Control in Unity Catalog

Upcoming Public Classes

Date
Time
Language
Price
Apr 21
09 AM - 11 AM (Asia/Tokyo)
Japanese
Free
Apr 28
02 PM - 04 PM (Asia/Kolkata)
English
Free
Apr 28
09 AM - 11 AM (America/Los_Angeles)
English
Free
Apr 30
12 PM - 02 PM (Asia/Singapore)
English
Free
May 05
09 AM - 11 AM (America/Los_Angeles)
English
Free
May 06
02 PM - 04 PM (Asia/Kolkata)
English
Free
May 12
12 PM - 02 PM (Asia/Singapore)
English
Free
May 22
02 PM - 04 PM (Asia/Kolkata)
English
Free
May 29
03 PM - 05 PM (Europe/London)
English
Free
Jun 03
02 PM - 04 PM (Asia/Kolkata)
English
Free
Jun 06
09 AM - 11 AM (America/Los_Angeles)
English
Free
Jun 20
12 PM - 02 PM (Asia/Singapore)
English
Free
Jun 23
09 AM - 11 AM (Asia/Tokyo)
Japanese
Free
Jun 26
03 PM - 05 PM (Europe/London)
English
Free
Jun 30
02 PM - 04 PM (Asia/Kolkata)
English
Free

Public Class Registration

If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.

Private Class Request

If your company is interested in private training, please submit a request.

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Runtime

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Register now

Instructors

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Register now

Learning

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Scale

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Data Engineer

Automated Deployment with Databricks Asset Bundles

This course provides a comprehensive review of DevOps principles and their application to Databricks projects. It begins with an overview of core DevOps, DataOps, continuous integration (CI), continuous deployment (CD), and testing, and explores how these principles can be applied to data engineering pipelines.

The course then focuses on continuous deployment within the CI/CD process, examining tools like the Databricks REST API, SDK, and CLI for project deployment. You will learn about Databricks Asset Bundles (DABs) and how they fit into the CI/CD process. You’ll dive into their key components, folder structure, and how they streamline deployment across various target environments in Databricks. You will also learn how to add variables, modify, validate, deploy, and execute Databricks Asset Bundles for multiple environments with different configurations using the Databricks CLI.

Finally, the course introduces Visual Studio Code as an Interactive Development Environment (IDE) for building, testing, and deploying Databricks Asset Bundles locally, optimizing your development process. The course concludes with an introduction to automating deployment pipelines using GitHub Actions to enhance the CI/CD workflow with Databricks Asset Bundles.

By the end of this course, you will be equipped to automate Databricks project deployments with Databricks Asset Bundles, improving efficiency through DevOps practices.

Note: This course is the fourth in the 'Advanced Data Engineering with Databricks' series.

Free
2h
Professional

Questions?

If you have any questions, please refer to our Frequently Asked Questions page.