Skip to main content
Page 1

How Databricks’ Data Team Built a Lakehouse Across 3 Clouds and 50+ Regions

July 14, 2021 by Jason Pohl and Suraj Acharya in
The internal logging infrastructure at Databricks has evolved over the years and we have learned a few lessons along the way about how...

Building a Cybersecurity Lakehouse for CrowdStrike Falcon Events

Get started now in your own Databricks deployment and run these notebooks . Endpoint data is required by security teams for threat detection...

Upgrade Production Workloads to Be Safer, Easier, and Faster With Databricks Runtime 7.3 LTS

March 8, 2021 by Jason Pohl in
What a difference a year makes. One year ago, Databricks Runtime version (DBR) 6.4 was released -- followed by 8 more DBR releases...

Top 5 Reasons to Convert Your Cloud Data Lake to a Delta Lake

August 21, 2020 by Jason Pohl in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. If you examine the...

Building Complex Data Pipelines with Unified Analytics Platform

October 5, 2017 by Jules Damji and Jason Pohl in
Introduction Big data practitioners often post recurring questions on Quora: What is data engineering? How to become a data scientist? What’s a data...

Managing and Securing Credentials in Databricks for Apache Spark Jobs

June 20, 2017 by Jason Pohl in
Since Apache Spark separates compute from storage, every Spark Job requires a set of credentials to connect to disparate data sources. Storing those...

Apache Spark Scala Library Development with Databricks

December 12, 2016 by Jason Pohl in
Try this notebook in Databricks The movie Toy Story was released in 1995 by Pixar as the first feature-length computer animated film. Even...