Skip to main content
<
Page 84
>

Apache Spark’s Structured Streaming with Amazon Kinesis on Databricks

August 9, 2017 by Jules Damji in
On July 11, 2017, we announced the general availability of Apache Spark 2.2.0 as part of Databricks Runtime 3.0 (DBR) for the Unified...

Databricks Named as a Strong Performer in The Forrester Wave: Insight Platforms-as-a-Service, Q3 2017

August 8, 2017 by Bharath Gowda in
Forrester recently published The Forrester Wave: Insight Platforms-as-a-Service Wave, Q3 2017 . In its 36-criteria evaluation of insight platform-as-a-service (PaaS) providers, Forrester identified...

On-Demand Webinar and FAQ: Accelerate Data Science with Better Data Engineering on Databricks

On July 13th, we hosted a live webinar — Accelerate Data Science with Better Data Engineering on Databricks . This webinar focused on...

Breaking the “curse of dimensionality” in Genomics using “wide” Random Forests

This is a guest blog from members of CSIRO’s transformational bioinformatics team in Sydney, Australia. CSIRO, Australia’s government research agency, is in the...

Integrating Apache Airflow with Databricks

July 19, 2017 by Andrew Chen in
This blog post is part of our series of internal engineering blogs on Databricks platform, infrastructure management, integration, tooling, monitoring, and provisioning. Today...

Serverless Continuous Delivery with Databricks and AWS CodePipeline

July 13, 2017 by Kevin Rasmussen in
Two characteristics commonly mark many companies' success. First, they quickly adapt to new technology. Second, as a result, they gain technological leadership and...

4 SQL High-Order and Lambda Functions to Examine Complex and Structured Data in Databricks

June 27, 2017 by Jules Damji in
Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Declarative Infrastructure with the Jsonnet Templating Language

June 26, 2017 by Eric Liang and Aaron Davidson in
This blog post is part of our series of internal engineering blogs on Databricks platform, infrastructure management, integration, tooling, monitoring, and provisioning. At...

Shell Oil Use Case: Parallelizing Large Simulations with Apache SparkR on Databricks

This blog post is a joint engineering effort between Shell’s Data Science Team ( Wayne W. Jones and Dennis Vallinga ) and Databricks...

Managing and Securing Credentials in Databricks for Apache Spark Jobs

June 20, 2017 by Jason Pohl in
Since Apache Spark separates compute from storage, every Spark Job requires a set of credentials to connect to disparate data sources. Storing those...