Skip to main content
<
Page 12
>

Koalas: Easy Transition from pandas to Apache Spark

April 24, 2019 by Tony Liu, Tim Hunter and Cyrielle Simeone in
Today at Spark + AI Summit, we announced Koalas, a new open source project that augments PySpark’s DataFrame API to make it compatible...

Managing the Complete Machine Learning Lifecycle: On-Demand Webinar now available!

April 4, 2019 by Andy Konwinski in
On March 7th, our team hosted a live webinar— Managing the Complete Machine Learning Lifecycle —with Andy Konwinski, Co-Founder and VP of Product...

Introducing Databricks AWS IAM Credential Passthrough

March 26, 2019 by Silvio Fiorito and Greg Owen in
As more and more analytics move to the cloud, customers are faced with the challenge of how to control which users have access...

Azure Databricks – Bring Your Own VNET

March 20, 2019 by Abhinav Garg and Anna Shrestinian in
Azure Databricks Unified Analytics Platform is the result of a joint product/engineering effort between Databricks and Microsoft. It’s available as a managed first-party...

A Guide to Data Science, Python, and Advanced Analytics Talks at Spark + AI Summit 2019

March 20, 2019 by Sophie Seddighzadeh in
With a tsunami of data, scale of computing resources available, and rapid development of easy-to-learn open source Machine Learning frameworks, data science and...

A Guide to Developer, Deep Dive, and Continuous Streaming Applications Talks at Spark + AI Summit

February 19, 2019 by Jules Damji in
In January 2013 when Stephen O’Grady, an analyst at RedMonk , published “The New Kingmakers: How Developers Conquered the World ,” the book’s...

Near-Real-Time Hardware Failure Rate Estimation with Bayesian Reasoning

February 14, 2019 by Sean Owen in
Try this notebook in Databricks You might be using Bayesian techniques in your data science without knowing it! And if you're not, then...

New videos from Databricks Academy: Introduction to Machine Learning Series and the Apache Spark™ Cost-Based Optimizer

January 18, 2019 by Joshua Cook in
Databricks' commitment to education is at the center of the work we do. Through Instructor-Led Training, Certification, and Self-Paced Training, Databricks Academy provides...

5 Reasons to Become an Apache Spark Expert

January 15, 2019 by Michael Ortega in
Apache Spark ™ has fast become the most popular unified analytics engine for big data and machine learning. It was originally developed at...

Apparate: Managing Libraries in Databricks with CI/CD

January 15, 2019 by Hanna Torrence in
This is a guest blog from Hanna Torrence, Data Scientist at ShopRunner. Introduction As leveraging data becomes a more vital component of organizations'...