Skip to main content
<
Page 59
>

Retail and Consumer Goods Sessions You Don’t Want to Miss at Spark + AI Summit 2020

June 16, 2020 by Rob Saker and Hector Leano in
The current economic environment is having a significant impact on the Retail and Consumer Goods sector. Rapid changes in how consumers shop is...

Simplify Data Conversion from Apache Spark to TensorFlow and PyTorch

June 16, 2020 by Liang Zhang and Weichen Xu in
Petastorm is a popular open-source library from Uber that enables single machine or distributed training and evaluation of deep learning models from datasets...

Accelerating Somatic Variant Calling with the Databricks TNSeq Pipeline

Genetic analyses are a critical tool in revolutionizing how we treat cancer. By understanding the mutations present in tumor cells, researchers can gain...

Enterprise Cloud Service Public Preview on AWS

June 12, 2020 by Vinay Wagh and Abhinav Garg in
At Databricks, we have had the opportunity to collaborate with companies that have transformed the way people live. Some of our customers have...

A Guide to MLflow Talks at Spark + AI Summit 2020

June 12, 2020 by Cyrielle Simeone in
It's been 2 years since we originally launched MLflow , an open source platform for the full machine learning lifecycle, and we are...

Modernizing Risk Management Part 2: Aggregations, Backtesting at Scale and Introducing Alternative Data

June 5, 2020 by Antoine Amend in
Understanding and mitigating risk is at the forefront of any financial services institution. However, as previously discussed in the first blog of this...

Automate continuous integration and continuous delivery on Databricks using Databricks Labs CI/CD Templates

CONTENTS Overview Why do we need yet another deployment framework? Simplifying CI/CD on Databricks via reusable templates Development lifecycle using Databricks Deployments How...

Customer Lifetime Value Part 1: Estimating Customer Lifetimes

Download the Customer Lifetimes Part 1 notebook to demo the solution covered below, and watch the on-demand virtual workshop to learn more. You...

Monitor Your Databricks Workspace with Audit Logs

June 2, 2020 by Craig Ng and Miklos Christine in
Cloud computing has fundamentally changed how companies operate - users are no longer subject to the restrictions of on-premises hardware deployments such as...

Vectorized R I/O in Upcoming Apache Spark 3.0

June 1, 2020 by Hyukjin Kwon in
R is one of the most popular computer languages in data science, specifically dedicated to statistical analysis with a number of extensions, such...