Skip to main content
<
Page 35
>

Restricting Libraries in JVM Compute Platforms

August 23, 2022 by Thomas Garnier in
Security challenges with Scala and Java libraries Open source communities have built incredibly useful libraries. They simplify many common development scenarios. Through our...

Feature Deep Dive: Watermarking in Apache Spark Structured Streaming

August 22, 2022 by Max Fisher in
Key Takeaways Watermarks help Spark understand the processing progress based on event time, when to produce windowed aggregates and when to trim the...

How to Migrate Your Data and AI Workloads to Databricks With the AWS Migration Acceleration Program

August 18, 2022 by Naseer Ahmed in
In this blog we define the process for earning AWS customer credits when migrating Data and AI workloads to Databricks on Amazon Web...

Low-Code Exploratory Data Analysis with Bamboolib in Databricks

August 14, 2022 by Austin Ford in
We are very excited to announce that the public preview of bamboolib in the Databricks Notebook begins today! It is available with the...

MLOps on Databricks with Vertex AI on Google Cloud

August 12, 2022 by Maggie Chu and Alexey Volkov in
Since the launch of Databricks on Google Cloud in early 2021, Databricks and Google Cloud have been partnering together to further integrate the...

Orchestrating Data and ML Workloads at Scale: Create and Manage Up to 10k Jobs Per Workspace

Databricks Workflows is the fully-managed orchestrator for data, analytics, and AI. Today, we are happy to announce several enhancements that make it easier...

Low-latency Streaming Data Pipelines with Delta Live Tables and Apache Kafka

August 9, 2022 by Frank Munz in
Delta Live Tables (DLT) is the first ETL framework that uses a simple declarative approach for creating reliable data pipelines and fully manages...

Databricks and Jupyter: Announcing ipywidgets in the Databricks Notebook

Today, we are excited to announce a deeper integration between the Databricks Notebook and the ecosystem established by Project Jupyter, a leader in...

Near Real-Time Anomaly Detection with Delta Live Tables and Databricks Machine Learning

Why is Anomaly Detection Important? Whether in retail, finance, cyber security, or any other industry, spotting anomalous behavior as soon as it happens...

Identity Columns to Generate Surrogate Keys Are Now Available in a Lakehouse Near You!

August 8, 2022 by Franco Patano in
What is an identity column? An identity column is a column in a database that automatically generates a unique ID number for each...