Platform | Databricks Blog

Page 84

Introducing the Natural Language Processing Library for Apache Spark

October 19, 2017 by David Talby in Solutions

This is a community blog and effort from the engineering team at John Snow Labs, explaining their contribution to an open-source Apache Spark...

3 Things CISO’s expect from Tech Companies in a Cloudy World

October 17, 2017 by David Cook in Company

Adding new software to an enterprise is a difficult process. In the past, choosing new software only required budget approval before it could...

Building Complex Data Pipelines with Unified Analytics Platform

October 5, 2017 by Jules Damji and Jason Pohl in Platform

Introduction Big data practitioners often post recurring questions on Quora: What is data engineering? How to become a data scientist? What’s a data...

Databricks invites Colleen Lewis to Speak about Diversity in the Workplace

September 15, 2017 by Angelos Mikelatos in Announcements

First I'll start with the sad truth. The technology industry at large has taken many hits over the years for discriminatory practices and...

Looker and Databricks Partner to Bring Data Scientists and Business Users Together

September 14, 2017 by Brian Dirking in Company

We are very excited today as we announce a partnership between Databricks and Looker. We have seen customers using these products together to...

Learn about Apache Spark APIs and Best Practices

September 12, 2017 by Jules Damji and Silvio Fiorito in Company

Since Apache Spark 1.3, Spark and its APIs have evolved to make them easier, faster, and smarter. The goal has been to unify...

Build, Scale, and Deploy Deep Learning Pipelines with Ease

September 6, 2017 by Sue Ann Hong and Tim Hunter in Announcements

At the Spark Summit in San Francisco in June , we announced an open-source project Deep Learning Pipelines . Deep Learning Pipelines provides...

A Summer of Personal and Professional Growth at Databricks

September 5, 2017 by Karen Feng in Company

This summer, I worked at Databricks as a software engineering intern on the Growth team. By introducing two new features, user groups and...

Do your Streaming ETL at Scale with Apache Spark’s Structured Streaming

September 1, 2017 by Tathagata Das in Announcements

At the Spark Summit in San Francisco in June , we announced that Apache Spark’s Structured Streaming is marked as production-ready and shared...

Best Practices for Coarse Grained Data Security in Databricks

August 23, 2017 by Bill Chambers and Jules Damji in Platform

At Databricks, we work with hundreds of companies, all pushing the bleeding edge in their respective industries. We want to share patterns for...