Databricks Certification and Badging
The new standard for lakehouse training and certifications


Databricks Certified Associate Developer for Apache Spark
The Databricks Certified Associate Developer for Apache Spark certification exam assesses the understanding of the Apache Spark Architecture and Components and the ability to apply the Spark DataFrame API to complete basic data manipulation tasks within a Spark session. These tasks include selecting, renaming and manipulating columns; filtering, dropping, sorting, and aggregating rows; handling missing data; combining, reading, writing and partitioning DataFrames with schemas; and working with UDFs and Spark SQL functions. In addition, the exam will assess the basics of the Spark architecture like execution/deployment modes, the execution hierarchy, fault tolerance, garbage collection, lazy evaluation, Shuffling and usage of Actions and broadcasting, Structured Streaming, Spark Connect, and common troubleshooting and tuning techniques. Individuals who pass this certification exam can be expected to complete basic Spark DataFrame tasks using Python.
This exam covers:
- Apache Spark Architecture and Components - 20%
- Using Spark SQL - 20%
- Developing Apache Spark™ DataFrame/DataSet API Applications - 30%
- Troubleshooting and Tuning Apache Spark DataFrame API Applications - 10%
- Structured Streaming - 10%
- Using Spark Connect to deploy applications - 5%
- Using Pandas API on Apache Spark - 5%
Assessment Details
Type: Proctored certification
Total number of questions: 45
Time limit: 90 minutes
Registration fee: $200
Question types: Multiple choice
Test aides: None allowed
Languages: English
Delivery method: Online proctored, OnSite Proctored
Prerequisites: None, but related training highly recommended
Recommended experience: 6+ months of hands-on experience performing the machine learning tasks outlined in the exam guide
Validity period: 2 years
Recertification: Recertification is required every two years to maintain your certified status. To recertify, you must take the current version of the exam. Please review the “Getting Ready for the Exam” section below to prepare for your recertification exam.
Unscored content: Exams may include unscored items to gather statistical information for future use. These items are not identified on the form and do not impact your score. Additional time is factored into the exam to account for this content.
Related Training
- Instructor-led: Apache Spark™ Programming with Databricks
- Self-paced (available in Databricks Academy): Apache Spark™ Programming with Databricks
Getting Ready for the Exam
- Review the Databricks Certified Associate Developer for Apache Spark to understand what will be on the exam
- Take the related training
- Register for the exam
- Review the technical requirements and run a system check
- Review the exam guide again to identify any gaps
- Study to fill in the gaps
- Take your exam!
All learning code/code snippets within this exam will be in Python.
Registration
To register for a certification exam, please log in or create an account on our exam delivery platform.