Articles by Joseph Bradley - Databricks Blog

Page 2

Detecting Abuse at Scale: Locality Sensitive Hashing at Uber Engineering

May 9, 2017 by Yun Ni, Kelvin Chu and Joseph Bradley in Solutions

This is a cross blog post effort between Databricks and Uber Engineering. Yun Ni is a software engineer on Uber’s Machine Learning Platform...

Intel’s BigDL on Databricks

February 8, 2017 by Sue Ann Hong and Joseph Bradley in Engineering Blog

Try this notebook on Databricks Intel recently released its BigDL project for distributed deep learning on Apache Spark. BigDL has native Spark integration...

Deep Learning on Databricks

December 21, 2016 by Joseph Bradley and Tim Hunter in Engineering Blog

We are excited to announce the general availability of Graphic Processing Unit (GPU) and deep learning support on Databricks! This blog post will...

On Demand Webinar and FAQ: Apache Spark MLlib 2.x: Migrating ML Workloads to DataFrames

December 14, 2016 by Joseph Bradley and Jules Damji in Company Blog

Last week, we held a live webinar, Apache Spark MLlib 2.x: Migrating ML Workloads to DataFrames , to demonstrate the ease with which...

GPU Acceleration in Databricks

October 26, 2016 by Joseph Bradley, Tim Hunter and Yandong Mao in Engineering Blog

Databricks is adding support for Apache Spark clusters with Graphics Processing Units (GPUs), ready to accelerate Deep Learning workloads. With Spark deployments tuned...

Apache Spark 2.0 Preview: Machine Learning Model Persistence

May 31, 2016 by Joseph Bradley in Engineering Blog

Introduction Consider these Machine Learning (ML) use cases: A data scientist produces an ML model and hands it over to an engineering team...

Approximate Algorithms in Apache Spark: HyperLogLog and Quantiles

May 19, 2016 by Tim Hunter, Hossein Falaki and Joseph Bradley in Solutions

Introduction Apache Spark is fast, but applications such as preliminary data exploration need to be even faster and are willing to sacrifice some...

On-Time Flight Performance with GraphFrames for Apache Spark

March 15, 2016 by Joseph Bradley, Bill Chambers and Denny Lee in Engineering Blog

Introduction Graph structures are a more intuitive approach to many classes of data problems. Whether traversing social networks, restaurant recommendations, or flight paths...

Introducing GraphFrames

March 3, 2016 by Ankur Dave, Joseph Bradley and Tim Hunter in Engineering Blog

We would like to thank Ankur Dave from UC Berkeley AMPLab for his contribution to this blog post. Databricks is excited to announce...

Auto-scaling scikit-learn with Apache Spark

February 8, 2016 by Tim Hunter and Joseph Bradley in Engineering Blog

Data scientists often spend hours or days tuning models to get the highest accuracy. This tuning typically involves running a large number of...