Skip to main content
<
Page 86
>

Managing and Securing Credentials in Databricks for Apache Spark Jobs

June 20, 2017 by Jason Pohl in
Since Apache Spark separates compute from storage, every Spark Job requires a set of credentials to connect to disparate data sources. Storing those...

Analysing Metro Operations Using Apache Spark on Databricks

This is a guest blog from EY Advisory Data & Analytics team, who have been working with Sporveien in Oslo building a platform...

Databricks Serverless: Next Generation Resource Management for Apache Spark

As the amount of data in an organization grows, more and more engineers, analysts and data scientists need to analyze this data using...

Sharing Knowledge with the Community in a Preview of Apache Spark: The Definitive Guide

Apache Spark has seen immense growth over the past several years. The size and scale of this Spark Summit is a true reflection...

Integrating Apache Spark with Cucumber for Behavioral-Driven Development

June 2, 2017 by Aaron Colcord and Zachary Nanfelt in
This is a guest blog from FIS Global One of the most difficult scenarios in data processing is ensuring that the data is...

Apache Spark Cluster Monitoring with Databricks and Datadog

June 1, 2017 by Caryl Yuhas and Ilan Rabinovitch in
This blog post is a joint effort between Caryl Yuhas, Databricks’ Solutions Architect, and Ilan Rabinovitch, Datadog’s ‎Director of Technical Community and Evangelism...

Transactional Writes to Cloud Storage on Databricks

In another blog post published today , we showed the top five reasons for choosing S3 over HDFS. With the dominance of simple...

Top 5 Reasons for Choosing S3 over HDFS

May 31, 2017 by Reynold Xin, Josh Rosen and Kyle Pistor in
At Databricks, our engineers guide thousands of organizations to define their big data and cloud strategies. When migrating big data workloads to the...

Entropy-based Log Redaction for Apache Spark on Databricks

May 30, 2017 by Weiluo Ren and Yu Peng in
This blog post is part of our series of internal engineering blogs on Databricks platform, infrastructure management, tooling, monitoring, and provisioning. We love...

Using sparklyr in Databricks

May 25, 2017 by Hossein Falaki in
Try this notebook on Databricks with all instructions as explained in this post notebook In September 2016, RStudio announced sparklyr , a new...