Engineering | Databricks Blog

Page 33

How to Manage Python Dependencies in PySpark

December 22, 2020 by Hyukjin Kwon in Engineering

Controlling the environment of an application is often challenging in a distributed computing environment - it is difficult to ensure all nodes have...

Natively Query Your Delta Lake With Scala, Java, and Python

December 22, 2020 by Shixiong Zhu, Scott Sandre and Denny Lee in Engineering

Today, we’re happy to announce that you can natively query your Delta Lake with Scala and Java (via the Delta Standalone Reader) and...

Personalizing the Customer Experience with Recommendations

December 18, 2020 by Rob Saker, Bryan Smith, Bilaji Raman, Ye Wang, Yiyan Zhang and Terry Tang in Engineering

Go directly to the Recommendation notebooks referenced throughout this post . Retail made a giant leap forward in the adoption of e-commerce in...

A Step-by-step Guide for Debugging Memory Leaks in Spark Applications

December 16, 2020 by Shivansh Srivastava in Engineering

This is a guest authored post by Shivansh Srivastava, software engineer, Disney Streaming Services. It was originally published on Medium.com Just a bit...

Handling Late Arriving Dimensions Using a Reconciliation Pattern

December 15, 2020 by Chaitanya Chandurkar in Company

This is a guest community post authored by Chaitanya Chandurkar , Senior Software Engineer in the Analytics and Reporting team at McGraw Hill...

Python Autocomplete Improvements for Databricks Notebooks

December 15, 2020 by Richard Fung, Xinrong Meng, Takuya Ueshin, Hyukjin Kwon and Austin Ford in Engineering

At Databricks, we strive to provide a world-class development experience for data scientists and engineers, and new features are constantly getting added to...

Learn How Disney+ Built Their Streaming Data Analytics Platform With Databricks and AWS to Improve the Customer Experience

December 14, 2020 by Hector Leano in Data Streaming

https://youtu.be/WAOrqsHpJuM Martin Zapletal, Software Engineering Director at Disney+, is presenting at re:Invent 2020 with the session "How Disney+ uses fast data ubiquity to...