Performance Tuning on Apache Spark

May 24, 2021 06:00 AM (PT)

Complete guided challenges as you learn to diagnose and fix poorly performing queries. Using Python/Scala, participants will review performance problems to uncover solutions and best practices to be applied to your queries.

Prerequisites: 

  • 6+ months experience working with the Spark DataFrame API is recommended
  • Intermediate programming experience in Python or Scala

 

Role: Data Engineer, ML Engineer, Data Scientist

Duration: Full day

Labs: Yes