Engineering | Databricks Blog

Page 72

Distributing the Singular Value Decomposition with Apache Spark

Guest post by Li Pu from Twitter and Reza Zadeh from Databricks on their recent contribution to Apache Spark's machine learning library. The...

This post originally appeared in insideBIGDATA and is reposted here with permission. With the second Spark Summit behind us, we wanted to take...

MLlib is an Apache Spark component focusing on machine learning. It became a standard component of Spark in version 0.8 (Sep 2013). The...

With the introduction of Spark SQL and the new Hive on Apache Spark effort ( HIVE-7292 ), we get asked a lot about...

Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Today, we’re very proud to announce the release of Apache Spark 1.0 . Apache Spark 1.0 is a major milestone for the Spark...

One of Apache Spark’s main goals is to make big data applications easier to write. Spark has always had concise APIs in Scala...

We are happy to announce the availability of Apache Spark 0.9.1 ! This is a maintenance release with bug fixes, performance improvements, better...

Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

This article was cross-posted in the Cloudera developer blog . Apache Spark is well known today for its performance benefits over MapReduce...