Engineering | Databricks Blog

Page 62

Introducing Apache Spark 2.1

December 28, 2016 by Reynold Xin in Engineering

Spark Summit will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it sells...

10 Things I Wish I Knew Before Using Apache SparkR

December 28, 2016 by Neil Dewar in Engineering

This is a guest post from Neil Dewar , a senior data science manager at a global asset management firm. In this blog...

Deep Learning on Databricks

December 21, 2016 by Joseph Bradley and Tim Hunter in Engineering

We are excited to announce the general availability of Graphic Processing Unit (GPU) and deep learning support on Databricks! This blog post will...

Scalable Partition Handling for Cloud-Native Architecture in Apache Spark 2.1

December 15, 2016 by Eric Liang, Michael Allman and Wenchen Fan in Engineering

Apache Spark 2.1 is just around the corner: the community is going through voting process for the release candidates. This blog post discusses...

Databricks Bi-Weekly Apache Spark Digest: 11/16/16

November 15, 2016 by Jules Damji in Engineering

Spark Summit Talks and Apache Spark Roundup Databricks and partners set a new world record for CloudSort 2016 Benchmark using Apache Spark...

$1.44 per terabyte: setting a new world record with Apache Spark

November 14, 2016 by Reynold Xin in Engineering

We are excited to share with you that a joint effort by Nanjing University, Alibaba Group, and Databricks set a new world record...

GPU Acceleration in Databricks

October 26, 2016 by Joseph Bradley, Tim Hunter and Yandong Mao in Engineering

Databricks is adding support for Apache Spark clusters with Graphics Processing Units (GPUs), ready to accelerate Deep Learning workloads. With Spark deployments tuned...

Databricks Bi-Weekly Apache Spark Digest: 10/4/16

October 4, 2016 by Jules Damji in Engineering

Here’s our recap of what’s transpired with Apache Spark since our previous digest . Databricks Apache Spark Survey 2016 Report published and now...

Voice from CERN: Apache Spark 2.0 Performance Improvements Investigated With Flame Graphs

October 3, 2016 by Luca Canali in Engineering

This is a guest post from CERN, the European Organization for Nuclear Research. In this blog, Luca Canali of CERN investigates performance improvements...

Apache Spark @Scale: A 60 TB+ production use case from Facebook

August 31, 2016 by Sital Kedia, Shuojie Wang and Avery Ching in Solutions

This is a guest Apache Spark community blog from Facebook Engineering . In this technical blog, Facebook shares their usage of Apache Spark...