A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets

Of all the developers’ delight, none is more attractive than a set of APIs that make developers productive, that is easy to use, and that is intuitive and expressive. One of Apache Spark’s appeal to developers has been its easy-to-use APIs, for operating on large datasets, across languages: Scala, Java, Python, and R. In this … Continue reading A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets