Sharethrough Uses Apache Spark Streaming to Optimize Bidding in Real TimeMarch 25, 2014 by Russell Cardullo and Michael Ruggiero in Company Blog We're very happy to see our friends at Cloudera continue to get the word out about Apache Spark, and their latest blog post...
Apache Spark: A Delight for DevelopersMarch 20, 2014 by Jai Ranganathan and Matei Zaharia in Engineering Blog This article was cross-posted in the Cloudera developer blog . Apache Spark is well known today for its performance benefits over MapReduce...
Apache Spark Now a Top-level Apache ProjectMarch 2, 2014 by Ion Stoica in Engineering Blog We are delighted with the recent announcement of the Apache Software Foundation that Apache Spark has become a top-level Apache project. This is...
AMPLab updates the Big Data BenchmarkFebruary 12, 2014 by Ahir Reddy and Reynold Xin in Engineering Blog The AMPLab at UC Berkeley, with help from Databricks, recently released an update to the Big Data Benchmark . This benchmark uses Amazon...
Apache Spark 0.9.0 ReleasedFebruary 3, 2014 by Patrick Wendell in Engineering Blog Our goal with Apache Spark is very simple: provide the best platform for computation on big data. We do this through both a...
Apache Spark and Hadoop: Working TogetherJanuary 21, 2014 by Ion Stoica in Engineering Blog We are often asked how does Apache Spark fits in the Hadoop ecosystem , and how one can run Spark in a existing...
Apache Spark In MapReduce (SIMR)January 1, 2014 by Ali Ghodsi and Ahir Reddy in Engineering Blog Apache Hadoop integration has always been a key goal of Apache Spark and YARN users have long been able to run Spark on...
Apache Spark 0.8.1 ReleasedDecember 19, 2013 by Patrick Wendell in Engineering Blog We are happy to announce the release of Apache Spark 0.8.1. In addition to performance and stability improvements, this release adds three new...
Highlights From Spark Summit 2013December 18, 2013 by Andy Konwinski in Company Blog Earlier this month we held the first Spark Summit, a conference to bring the Apache Spark community together. We are excited to share...
Putting Apache Spark to Use: Fast In-Memory Computing for Your Big Data ApplicationsNovember 21, 2013 by Pat McDonough in Engineering Blog A version of this post appears on the Cloudera Blog . Apache Hadoop has revolutionized big data processing, enabling users to store and...