Databricks Blog

Page 183

Accelerating R Workflows on Databricks

October 6, 2017 by Hossein Falaki in Engineering

At Databricks we strive to make our Unified Analytics Platform the best place to run big data analytics. For big data, Apache Spark...

Building Complex Data Pipelines with Unified Analytics Platform

October 5, 2017 by Jules Damji and Jason Pohl in Platform

Introduction Big data practitioners often post recurring questions on Quora: What is data engineering? How to become a data scientist? What’s a data...

Bay Area Apache Spark Meetup at HPE/Aruba Networks Summary

September 22, 2017 by Jules Damji in Company

On September 7th, we held our monthly Bay Area Apache Spark Meetup (BASM) at HPE/Aruba Networks in Santa Clara. We had two Apache...

Learn about Apache Spark’s Memory Model and Spark’s State in the Cloud

September 19, 2017 by Wenchen Fan and Nicolas Poggi in Company

Since Apache Spark 1.6, as part of the Project Tungsten , we started an ongoing effort to substantially improve the memory and CPU...

Databricks invites Colleen Lewis to Speak about Diversity in the Workplace

September 15, 2017 by Angelos Mikelatos in Announcements

First I'll start with the sad truth. The technology industry at large has taken many hits over the years for discriminatory practices and...

Looker and Databricks Partner to Bring Data Scientists and Business Users Together

September 14, 2017 by Brian Dirking in Company

We are very excited today as we announce a partnership between Databricks and Looker. We have seen customers using these products together to...

Learn about Apache Spark APIs and Best Practices

September 12, 2017 by Jules Damji and Silvio Fiorito in Company

Since Apache Spark 1.3, Spark and its APIs have evolved to make them easier, faster, and smarter. The goal has been to unify...

Build, Scale, and Deploy Deep Learning Pipelines with Ease

September 6, 2017 by Sue Ann Hong and Tim Hunter in Announcements

At the Spark Summit in San Francisco in June , we announced an open-source project Deep Learning Pipelines . Deep Learning Pipelines provides...

A Summer of Personal and Professional Growth at Databricks

September 5, 2017 by Karen Feng in Company

This summer, I worked at Databricks as a software engineering intern on the Growth team. By introducing two new features, user groups and...

Do your Streaming ETL at Scale with Apache Spark’s Structured Streaming

September 1, 2017 by Tathagata Das in Announcements

At the Spark Summit in San Francisco in June , we announced that Apache Spark’s Structured Streaming is marked as production-ready and shared...