Apache Spark Programming

Learn the fundamentals of Spark programming in a case study-driven course that explores the core components of the DataFrame API. During this Python/Scala based course you will:

  • Read and write data to various sources
  •  Preprocess data by correcting schemas and parsing different data types
  •  Apply multiple DataFrame transformations and actions to answer business questions.

Once completed, this course will give students the essential concepts and skills needed to navigate the Spark documentation and start programming immediately.

 

Prerequisites: 

  • No experience with Apache Spark is required
  • Basic familiarity programming in Python or Scala

 

Role: Data Engineer, ML Engineer

Duration: Full day

Labs: Yes