Apache Spark in Cloud and Hybrid: Why Security and Governance Become More Important

Download Slides

An Increasing number of Apache Spark deployments are in Cloud and hybrid environments. This often means that Spark workloads are ephemeral but the data exists in a durable storage either in cloud and on-prem. The data also moves between cloud storage and on-prem. With this architecture in place, security and governance have become paramount to run Spark workloads across on-prem and cloud. In this keynote, we will walk through several issues and highlight a Spark workload running in an ephemeral cluster with security and governance across Cloud/On-Prem and how the same security and governance is shared with other workloads.