Skip to main content
<
Page 28
>

How We Achieved High-bandwidth Connectivity With BI Tools

Databricks SQL is now generally available on AWS and Azure. Business Intelligence (BI) tools such as Tableau and Microsoft Power BI are notoriously...

How We Built Databricks on Google Kubernetes Engine (GKE)

August 6, 2021 by Frank Munz and Li Gao in
Our release of Databricks on Google Cloud Platform (GCP) was a major milestone toward a unified data, analytics and AI platform that is...

An Experimentation Pipeline for Extracting Topics From Text Data Using PySpark

This post is part of a series of posts on topic modeling. Topic modeling is the process of extracting topics from a set...

Getting Started With Ingestion into Delta Lake

July 23, 2021 by John O'Dwyer and Nancy Shah in
Ingesting data can be hard and complex since you either need to use an always-running streaming platform like Kafka or you need to...

The Delta Between ML Today and Efficient ML Tomorrow

Delta Lake and MLflow both come up frequently in conversation but often as two entirely separate products. This blog will focus on the...

Monitoring ML Models With Model Assertions

This is a guest post from the Stanford University Computer Science Department. We thank Daniel Kang, Deepti Raghavan and Peter Bailis of Stanford...

Unlocking the Power of Health Data With a Modern Data Lakehouse

A single patient produces approximately 80 megabytes of medical data every year. Multiply that across thousands of patients over their lifetime, and you're...

AML Solutions at Scale Using Databricks Lakehouse Platform

Anti-Money Laundering (AML) compliance has been undoubtedly one of the top agenda items for regulators providing oversight of financial institutions across the globe...

Feature Engineering at Scale

July 16, 2021 by Li Yu and Daniel Tomes in
Feature engineering is one of the most important and time-consuming steps of the machine learning process. Data scientists and analysts often find themselves...

Driving Transformation at Northwestern Mutual (Insights Platform) by Moving Towards a Scalable, Open Lakehouse Architecture

July 15, 2021 by Madhu Kotian in
This is a guest authored post by Madhu Kotian, Vice President of Engineering (Investment Products Data, CRM, Apps and Reporting) at Northwestern Mutual...