Skip to main content
<
Page 173
>

Introducing mlflow-apps: A Repository of Sample Applications for MLflow

August 16, 2018 by Juntai Zheng in
Introduction This summer, I was a software engineering intern at Databricks on the Machine Learning (ML) Platform team. As part of my intern...

100x Faster Bridge between Apache Spark and R with User-Defined Functions on Databricks

August 15, 2018 by Liang Zhang and Hossein Falaki in
SparkR User-Defined Function (UDF) API opens up opportunities for big data workloads running on Apache Spark to embrace R's rich package ecosystem. Some...

Building a Real-Time Attribution Pipeline with Databricks Delta

August 9, 2018 by Caryl Yuhas and Denny Lee in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. In digital advertising, one...

Loan Risk Analysis with XGBoost and Databricks Runtime for Machine Learning

August 9, 2018 by Amy Wang and Denny Lee in
Try this notebook series in Databricks For companies that make money off of interest on loans held by their customer, it’s always about...

MLflow 0.4.2 Released

August 8, 2018 by Aaron Davidson and Denny Lee in
Today, we’re excited to announce MLflow v0.4.0, MLflow v0.4.1, and v0.4.2 which we released within the last week with some of the recently...

A Guide to Data Science, Developer, and Deep Dive Talks at Spark + AI Summit Europe

August 7, 2018 by Jules Damji in
In October 2012, Harvard Business Review put a spotlight on the data science career with a dedicated issue and a catchy claim: Data...

Get Certified on Apache Spark™ with Databricks

August 3, 2018 by Donna Weber in
In a world of rapidly changing products, companies investing in technology need well-trained experts to run it. Certifications are a key differentiator in...

Processing Petabytes of Data in Seconds with Databricks Delta

July 31, 2018 by Adrian Ionescu in
Introduction Databricks Delta Lake is a unified data management system that brings data reliability and fast analytics to cloud data lakes . In...

rquery: Practical Big Data Transforms for R-Spark Users

July 26, 2018 by Nina Zumel and John Mount in
This is a guest community blog from Nina Zumel and John Mount , data scientists and consultants at Win-Vector . They share how...

Bay Area Apache Spark Meetup Summary @ Databricks HQ

July 25, 2018 by Jules Damji in
On July 19, we held our monthly Bay Area Spark Meetup (BASM) at Databricks, HQ in San Francisco. At the Spark + AI...