Open Source | Databricks Blog

Page 9

Modernizing Risk Management Part 2: Aggregations, Backtesting at Scale and Introducing Alternative Data

June 5, 2020 by Antoine Amend in Platform

Understanding and mitigating risk is at the forefront of any financial services institution. However, as previously discussed in the first blog of this...

Customer Lifetime Value Part 1: Estimating Customer Lifetimes

June 3, 2020 by Rob Saker, Bryan Smith, Bilal Obeidat and Chris Robison in Solutions

Download the Customer Lifetimes Part 1 notebook to demo the solution covered below, and watch the on-demand virtual workshop to learn more. You...

Monitor Your Databricks Workspace with Audit Logs

June 2, 2020 by Craig Ng and Miklos Christine in Platform

Cloud computing has fundamentally changed how companies operate - users are no longer subject to the restrictions of on-premises hardware deployments such as...

Vectorized R I/O in Upcoming Apache Spark 3.0

June 1, 2020 by Hyukjin Kwon in Platform

R is one of the most popular computer languages in data science, specifically dedicated to statistical analysis with a number of extensions, such...

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

May 29, 2020 by Wenchen Fan, Herman van Hövell and MaryAnn Xue in Engineering

Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Modernizing Risk Management Part 1: Streaming data-ingestion, rapid model development and Monte-Carlo Simulations at Scale

May 27, 2020 by Antoine Amend in Platform

Part 2 of this accelerator here . Managing risk within the financial services , especially within the banking sector, has increased in complexity...

Modernizing Risk Management Part 2: Aggregations, Backtesting at Scale and Introducing Alternative Data

Customer Lifetime Value Part 1: Estimating Customer Lifetimes

Monitor Your Databricks Workspace with Audit Logs

Vectorized R I/O in Upcoming Apache Spark 3.0

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

Modernizing Risk Management Part 1: Streaming data-ingestion, rapid model development and Monte-Carlo Simulations at Scale

New Pandas UDFs and Python Type Hints in the Upcoming Release of Apache Spark 3.0

Manage and Scale Machine Learning Models for IoT Devices

Shrink Training Time and Cost Using NVIDIA GPU-Accelerated XGBoost and Apache Spark™ on Databricks

Now on Databricks: A Technical Preview of Databricks Runtime 7 Including a Preview of Apache Spark 3.0