How Databricks improved query performance by up to 2.2x by automatically optimizing file sizesMay 22, 2023 by Sirui Sun, Himanshu Raja, Vijayan Prabhakaran, Terry Kim, Bart Samwel, Rahul Mahadev, Rajesh Parangi Sharabhalingappa, Rahul Potharaju and Kam Cheung Ting in Platform Blog Optimizing tables has long been a necessary but complicated task for data engineers. One particularly thorny area has been getting to the optimal...
Announcing the Delta Lake 0.3.0 ReleaseAugust 2, 2019 by Tathagata Das, Rahul Mahadev, Zhitong Yan and Prakash Chockalingam in Engineering Blog Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. We are excited to...