Skip to main content
<
Page 110
>

A Tale About Vulnerability Research and Early Detection

February 3, 2022 by Fermín J. Serna and Yanir Tsarimi in
This is a collaborative post between Databricks and Orca Security. We thank Yanir Tsarimi, Cloud Security Researcher, of Orca Security for their contribution...

Google Datastream Integration With Delta Lake for Change Data Capture

This is a collaborative post between the data teams as Badal, Google and Databricks. We thank Eugene Miretsky, Partner, and Steven Deutscher-Kobayashi, Senior...

Scaling SHAP Calculations With PySpark and Pandas UDF

February 2, 2022 by Sepideh Ebrahimi and P. Patel in
Motivation With the proliferation of applications of Machine Learning (ML) and especially Deep Learning (DL) models in decision making, it is becoming more...

Streamline MLOps With MLflow Model Registry Webhooks

As machine learning becomes more widely adopted, businesses need to deploy models at speed and scale to achieve maximum value. Today, we are...

Make Your Data Lakehouse Run, Faster With Delta Lake 1.1

Delta Lake 1.1 improves performance for merge operations, adds the support for generated columns and improves nested field resolution With the tremendous contributions...

The Ubiquity of Delta Standalone: Java, Scala, Hive, Presto, Trino, Power BI, and More!

The Delta Standalone library is a single-node Java library that can be used to read from and write to Delta tables. Specifically, this...

Orchestrating Databricks Workloads on AWS With Managed Workflows for Apache Airflow

January 27, 2022 by Naseer Ahmed and Igor Alekseev in
In this blog, we explore how to leverage Databricks’ powerful jobs API with Amazon Managed Apache Airflow (MWAA) and integrate with Cloudwatch to...

Investing in TickSmith: Enabling an E-Commerce Data Experience With Open Data Exchange

January 27, 2022 by Itai Weiss, Jay Bhankharia and Andrew Ferguson in
We are excited to announce Databricks Ventures’ investment in TickSmith, a leading SaaS platform that simplifies the online data shopping experience. The investment...

Creating a Faster TAR Extractor

January 26, 2022 by Christopher Denny in
Tarballs are used industry-wide for packaging and distributing files, and this is no different at Databricks. Every day we launch millions of VMs...

Building Data Applications on the Lakehouse With the Databricks SQL Connector for Python

We are excited to announce General Availability of the Databricks SQL Connector for Python . This follows the recent General Availability of Databricks...