With the community working on preparing the next versions of Apache Spark you may be asking yourself ‘how do I get involved in contributing to this?’ With such a large volume of contributions, it can be hard to know how to begin contributing yourself. Holden Karau offers a developer-focused head start, walking you through how to find good issues, formatting code, finding reviewers, and what to expect in the code review process. In addition to looking at how to contribute code we explore some of the other ways you can contribute to to Apache Spark from helping test release candidates, to doing the all important code reviews, bug triage, and many more (like answering questions).
Holden is an Apache Spark committer and PMC member who focus on PySpark and Kubernetes support. She is the co-author of Learning Spark, High Performance Spark, and another Spark book that’s a bit more out of date. She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal. Her current side project is working on a book to teach children distributed systems, http://www.distributedcomputing4kids.com/.