Open Source | Databricks Blog

Page 32

Announcing Apache Spark 1.0

Today, we’re very proud to announce the release of Apache Spark 1.0 . Apache Spark 1.0 is a major milestone for the Spark...

One of Apache Spark’s main goals is to make big data applications easier to write. Spark has always had concise APIs in Scala...

We are happy to announce the availability of Apache Spark 0.9.1 ! This is a maintenance release with bug fixes, performance improvements, better...

Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

This article was cross-posted in the Cloudera developer blog . Apache Spark is well known today for its performance benefits over MapReduce...

We are delighted with the recent announcement of the Apache Software Foundation that Apache Spark has become a top-level Apache project. This is...

The AMPLab at UC Berkeley, with help from Databricks, recently released an update to the Big Data Benchmark . This benchmark uses Amazon...

Our goal with Apache Spark is very simple: provide the best platform for computation on big data. We do this through both a...

January 21, 2014 by Ion Stoica in Engineering

We are often asked how does Apache Spark fits in the Hadoop ecosystem , and how one can run Spark in a existing...

Apache Hadoop integration has always been a key goal of Apache Spark and YARN users have long been able to run Spark on...