Skip to main content
<
Page 115
>

Pandas API on Upcoming Apache Spark™ 3.2

October 4, 2021 by Hyukjin Kwon and Xinrong Meng in
We're thrilled to announce that the pandas API will be part of the upcoming Apache Spark™ 3.2 release. pandas is a powerful, flexible...

Databricks SQL: Delivering a Production SQL Development Experience on the Data Lake

September 30, 2021 by Miranda Luna and Cyrielle Simeone in
Databricks SQL is now generally available on AWS and Azure. Databricks SQL (DB SQL) is a simple and powerful SQL analytics platform for...

Interning From a Distance

September 27, 2021 by Inbar Gam and Robin Lee in
Summer 2021 brought another summer of virtual game nights, pizza parties and team-building events for Databricks interns. In addition to working on impactful...

Shiny and Environments for R Notebooks

At Databricks, we want the Lakehouse ecosystem widely accessible to all data practitioners, and R is a great interface language for this purpose...

Catalog and Discover Your Databricks Notebooks Faster

September 22, 2021 by Darin McBeath and Vuong Nguyen in
This is a collaborative post from Databricks and Elsevier. We thank Darin McBeath, Director Disruptive Technologies -- Elsevier, for his contributions. As a...

Extracting Oncology Insights From Real-World Clinical Data With NLP

Preview the solution accelerator notebooks referenced in this blog online or get started right away by downloading and importing the notebooks into your...

Managing Model Ensembles With MLflow

In machine learning, an ensemble is a collection of diverse models that provide more predictive power together than any single model would on...

How YipitData Extracts Insights From Alternative Data Using Delta Lake

September 21, 2021 by Anup Segu and Bobby Muldoon in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is a guest...

Part 1: Implementing CI/CD on Databricks Using Databricks Notebooks and Azure DevOps

September 20, 2021 by Michael Shtelma and Piotr Majer in
Discussed code can be found here . This is the first part of a two-part series of blog posts that show how to...

Timeliness and Reliability in the Transmission of Regulatory Reports

September 17, 2021 by Antoine Amend and Fahmid Kabir in
Managing risk and regulatory compliance is an increasingly complex and costly endeavour. Regulatory change has increased 500% since the 2008 global financial crisis...