Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark - Databricks

Accelerating Shuffle: A Tailor-Made RDMA Solution for Apache Spark

Download Slides

The opportunity in accelerating Spark by improving its network data transfer facilities has been under much debate in the last few years. RDMA (remote direct memory access) is a network acceleration technology that is very prominent in the HPC (high-performance computing) world, but has not yet made its way to mainstream Apache Spark. Proper implementation of RDMA in network-oriented applications can improve scalability, throughput, latency and CPU utilization. In this talk we are going to present a new RDMA solution for Apache Spark that shows amazing improvements in multiple Spark use cases. The solution is under development in our labs, and is going to be released to the public as an open-source plug-in.
Session hashtag: #EUres3

About Yuval Degani

Yuval Degani is a Senior Manager of Engineering in Mellanox Technologies, leading a team of Software Engineers based in the San Francisco Bay Area. His team's focus is introducing new network acceleration technologies to Big Data and Machine Learning frameworks. Before that, Yuval was a developer, an architect and later a team lead in the areas of low-level kernel development for cutting edge high-performance network devices. Yuval holds a BSc in Computer Science from the Technion Institute of Technology, Israel.