Hosted Spark
Back to glossaryWhat is Hosted Spark?
Apache Spark is a fast and general cluster computing system for Big Data built around speed, ease of use, and advanced analytics that was originally built in 2009 at UC Berkeley. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. In addition, it also supports several other tools such as Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Spark Streaming for stream processing.Spark Provides Two Modes for Data Exploration:
- Interactive
- Batch

Hosted Spark Services Provide These Features:
- Interactive Scala, Python, and R coverings
- Batch submissions in Scala, Java, Python
- Multiple users are able to share the same server
- Allows users to submit jobs from anywhere through REST
- No code change is required do be done to your programs
Additional Resources
Back to glossary