Skip to main content
<
Page 14
>

Open Sourcing Databricks Integration Tools at Edmunds

November 12, 2018 by Shaun Elliott and Sam Shuster in
This is a guest post from Shaun Elliott, Data Engineering Tech Lead and Sam Shuster, Staff Engineer at Edmunds. What is Databricks and...

Introducing Flint: A time-series library for Apache Spark

September 11, 2018 by Li Jin and Kevin Rasmussen in
This is a joint guest community blog by Li Jin at Two Sigma and Kevin Rasmussen at Databricks; they share how to use...

Announcing Databricks Runtime 4.2!

July 18, 2018 by Todd Greenstein in
We’re excited to announce Databricks Runtime 4.2, powered by Apache Spark™. Version 4.2 includes updated Spark internals, new features, and major performance upgrades...

Viacom’s Journey to Improving Viewer Experiences with Real-time Analytics at Scale

April 20, 2018 by Michael Ortega in
With over 4 billion subscribers, Viacom is focused on delivering amazing viewing experiences to their global audiences. Core to this strategy is ensuring...

Improving Threat Detection in a Big Data World

High-profile cybersecurity breaches dominated headlines in 2017. In the first half of the year, over 1.9B records were stolen . That’s more than...

Using Databricks to Democratize Big Data and Machine Learning at McGraw-Hill Education

October 18, 2017 by Matthew Hogan in
This is a guest post from Matt Hogan, Sr. Director of Engineering, Analytics and Reporting at McGraw-Hill Education. McGraw-Hill Education is a 129-year-old...

On-Demand Webinar and FAQ: Accelerate Data Science with Better Data Engineering on Databricks

On July 13th, we hosted a live webinar — Accelerate Data Science with Better Data Engineering on Databricks . This webinar focused on...

Shell Oil Use Case: Parallelizing Large Simulations with Apache SparkR on Databricks

This blog post is a joint engineering effort between Shell’s Data Science Team ( Wayne W. Jones and Dennis Vallinga ) and Databricks...

Analysing Metro Operations Using Apache Spark on Databricks

This is a guest blog from EY Advisory Data & Analytics team, who have been working with Sporveien in Oslo building a platform...

Take Reports From Concept to Production with PySpark and Databricks

Introduction: What is MediaMath? MediaMath is a demand-side media buying and data management platform. This means that brands and ad agencies can use...