What’s New in Apache Spark 3.0

This course covers the new features and changes introduced to Apache Spark and the surrounding ecosystem during the past 12 months. It focuses on Spark 2.4 and3.0, updates to performance, monitoring, usability, stability, extensibility, PySpark, SparkR, Delta Lakes, Pandas, and MLFlow. Students will also learn about backwards compatibility with 2.x and the considerations required for updating to Spark 3.0. This course is follow along, no hands on exercises. Requirements – Familiarity with Apache Spark 2.x


 
Try Databricks
« back