Skip to main content
<
Page 132
>

Lakehouse Architecture Realized: Enabling Data Teams With Faster, Cheaper and More Reliable Open Architectures

January 8, 2021 by Ryan Boyd in
Databricks was founded under the vision of using data to solve the world’s toughest problems. We started by building upon our open source...

Bayesian Modeling of the Temporal Dynamics of COVID-19 Using PyMC3

In this post, we look at how to use PyMC3 to infer the disease parameters for COVID-19. PyMC3 is a popular probabilistic programming...

How to Manage Python Dependencies in PySpark

December 22, 2020 by Hyukjin Kwon in
Controlling the environment of an application is often challenging in a distributed computing environment - it is difficult to ensure all nodes have...

Natively Query Your Delta Lake With Scala, Java, and Python

Today, we’re happy to announce that you can natively query your Delta Lake with Scala and Java (via the Delta Standalone Reader) and...

Personalizing the Customer Experience with Recommendations

Go directly to the Recommendation notebooks referenced throughout this post . Retail made a giant leap forward in the adoption of e-commerce in...

A Step-by-step Guide for Debugging Memory Leaks in Spark Applications

December 16, 2020 by Shivansh Srivastava in
This is a guest authored post by Shivansh Srivastava, software engineer, Disney Streaming Services. It was originally published on Medium.com Just a bit...

Top Questions from Our Lakehouse Event

December 16, 2020 by Sam Steiny in
We recently held a virtual event , featuring CEO Ali Ghodsi, that showcased the vision of Lakehouse architecture and how Databricks helps customers...

Handling Late Arriving Dimensions Using a Reconciliation Pattern

December 15, 2020 by Chaitanya Chandurkar in
This is a guest community post authored by Chaitanya Chandurkar , Senior Software Engineer in the Analytics and Reporting team at McGraw Hill...

Python Autocomplete Improvements for Databricks Notebooks

At Databricks, we strive to provide a world-class development experience for data scientists and engineers, and new features are constantly getting added to...

Learn How Disney+ Built Their Streaming Data Analytics Platform With Databricks and AWS to Improve the Customer Experience

December 14, 2020 by Hector Leano in
https://youtu.be/WAOrqsHpJuM Martin Zapletal, Software Engineering Director at Disney+, is presenting at re:Invent 2020 with the session "How Disney+ uses fast data ubiquity to...