Skip to main content
<
Page 118
>

How Building Apache Zeppelin Led Me to Databricks

August 12, 2021 by Moonsoo Lee in
Today, I am excited to announce that I have officially joined Databricks as an Engineer on the Data Science team. This move comes...

Announcing the Databricks Beacons Program

August 12, 2021 by Karen Bajza in
With roots in academia and open source, we know much of Databricks’ success is due to the community- the data scientists, data engineers...

How We Achieved High-bandwidth Connectivity With BI Tools

Databricks SQL is now generally available on AWS and Azure. Business Intelligence (BI) tools such as Tableau and Microsoft Power BI are notoriously...

Introducing Support for gp3, Amazon’s New General Purpose SSD Volume

August 10, 2021 by Robert Saxby in
Databricks clusters on AWS now support gp3 volumes , the latest generation of Amazon Elastic Block Storage (EBS) general purpose SSDs. gp3 volumes...

5 Key Steps to Successfully Migrate From Hadoop to the Lakehouse Architecture

August 6, 2021 by Harsh Narula in
The decision to migrate from Hadoop to a modern cloud-based architecture like the lakehouse architecture is a business decision, not a technology decision...

How We Built Databricks on Google Kubernetes Engine (GKE)

August 6, 2021 by Frank Munz and Li Gao in
Our release of Databricks on Google Cloud Platform (GCP) was a major milestone toward a unified data, analytics and AI platform that is...

An Experimentation Pipeline for Extracting Topics From Text Data Using PySpark

This post is part of a series of posts on topic modeling. Topic modeling is the process of extracting topics from a set...

Databricks Lecture Series at UC Berkeley School of Information

July 29, 2021 by Rob Reed and Tia Foss in
This is a collaborative post from Databricks and UC Berkeley. We thank Tia Foss, Director of Philanthropy, UC Berkeley School of Information, for...

Augment Your SIEM for Cybersecurity at Cloud Scale

July 23, 2021 by Michael Ortega and Monzy Merza in
Over the last decade, security incident and event management tools (SIEMs) have become a standard in enterprise security operations. SIEMs have always had...

Getting Started With Ingestion into Delta Lake

July 23, 2021 by John O'Dwyer and Nancy Shah in
Ingesting data can be hard and complex since you either need to use an always-running streaming platform like Kafka or you need to...