Apache Spark™ Programming with Databricks
In this course, you will explore the fundamentals of Apache Spark and Delta Lake on Databricks. You will learn the architectural components of Spark, the DataFrame and Structured Streaming APIs, and how Delta Lake can improve your data pipelines. Lastly, you will execute streaming queries to process streaming data and understand the advantages of using Delta Lake.
- Familiarity with Python and basic programming concepts, including data types, lists, dictionaries, variables, functions, loops, conditional statements, exception handling, accessing classes, and using third-party libraries
- Basic knowledge of SQL, including writing queries using SELECT, WHERE, GROUP BY, ORDER BY, LIMIT, and JOIN
Outline
Day 1
- Spark overview
- Databricks platform overview
- SparkSQL
- DataFrame reader, writer, transformation, and aggregation
- Datetimes
- Complex types
Day 2
- User-defined functions (UDFs) and vectorized UDFs
- Spark internals
- Query optimization
- Partitioning
- Streaming API
- Delta Lake
Upcoming Public Classes
Date | Time | Language | Price |
---|---|---|---|
Nov 25 - 26 | 09 AM - 05 PM (America/New_York) | English | $1500.00 |
Dec 05 - 06 | 10 AM - 06 PM (America/New_York) | English | $1500.00 |
Dec 09 - 10 | 09 AM - 05 PM (Europe/London) | English | $1500.00 |
Dec 10 - 13 | 01 PM - 05 PM (Australia/Sydney) | English | $1500.00 |
Dec 17 - 20 | 10 AM - 02 PM (Europe/Paris) | English | $1500.00 |
Dec 19 - 20 | 09 AM - 05 PM (America/New_York) | English | $1500.00 |
Jan 06 - 07 | 09 AM - 05 PM (America/New_York) | English | $1500.00 |
Jan 07 - 10 | 08 AM - 12 PM (Asia/Kolkata) | English | $1500.00 |
Jan 13 - 14 | 09 AM - 05 PM (Europe/Paris) | English | $1500.00 |
Jan 22 - 23 | 09 AM - 05 PM (America/Chicago) | English | $1500.00 |
Jan 30 - 31 | 09 AM - 05 PM (Europe/Paris) | English | $1500.00 |
Public Class Registration
If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.
Private Class Request
If your company is interested in private training, please submit a request.
Registration options
Databricks has a delivery method for wherever you are on your learning journey
Self-Paced
Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos
Register nowInstructor-Led
Public and private courses taught by expert instructors across half-day to two-day courses
Register nowBlended Learning
Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase
Purchase nowSkills@Scale
Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details