Events - 3/34 - Databricks

Events

Filter:

Automated Hyperparameter Tuning, Scaling and Tracking on Databricks

Webinar

In this talk, we'll start with a brief survey of the most popular techniques for hyperparameter tuning (e.g., grid search, random search, and Bayesian optimization). We will then discuss open source tools that implement each of these techniques, helping to automate the search over hyperparameters. Finally, we will discuss and demo improvements we built for these tools in Databricks, including integration with MLflow:

  • Apache PySpark MLlib integration with MLflow for automatically tracking tuning
  • Hyperopt integration with Apache Spark to distribute tuning and with MLflow for automatic tracking

Unified Analytics – Unifying Data Pipelines & Machine Learning with Apache Spark

Regional Event

Los Angeles, California

In this workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your ML efforts. We’ll discuss how to leverage Apache Spark™️, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll learn how to use ML frameworks (i.e. Tensorflow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment and manage the deployment of models to production.

AWS + Databricks Dev Day Workshop | New York City

Regional Event

New York City

In this workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use ML frameworks (i.e. Tensorflow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.

Delta Lake Meetup: Bay Area Apache Spark Meetup @ Salesforce SF

Meetup

San Francisco

Join us for an evening of Bay Area Apache Spark Meetup featuring tech-talks about Apache Spark, Machine Learning, and Delta Lake at scale from Salesforce and Databricks.

From Raw Data to Predictive Models in Production with Databricks, Azure, and Talend

Partner Event

Europe

All business sectors face challenges driven by the integration of additional data sources, larger volumes and new scenarios such as personalisation and predictive analytics. Data Engineers and Data Scientists need smart, automated tools to deploy high-capacity pipelines. Microsoft Azure, Talend and Databricks have joined forces to meet these challenges. In this free workshop you'll learn how to: automatically build scalable pipelines without coding, spin up converge models, how to Integrate everything to speed up your projects.

Live Demo: Delta Lake

Live Demo

See how Delta Lake can help you build reliable data lakes at scale. Live demo by Databricks expert. Save your spot!

AWS + Databricks Dev Day Workshop | Toronto

Regional Event

Toronto

In this workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use ML frameworks (i.e. Tensorflow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.

Tokyo Spark Meetup: Spark Meetup Tokyo # 1 (Spark + AI Summit 2019)

Meetup

Tokyo, Japan

Information on the latest development status of Spark presented at Spark + AI Summit 2019, use case reports from users, and related OSS such as Koalas / MLflow / Delta Lake.

Modernising your data warehouse in 24 hours

Webinar

Europe

Join Databricks, Datalytyx and Talend for this free live webinar on ‘Modernising your data warehouse in 24 hours’ to see how easy it is to get your data-focussed initiatives over the technical barriers and into reality. In this webinar you'll see how to: Quickly spin up cloud-based technology, enable business users to integrate, cleanse and master data, automatically scale storage and compute resources, deliver it all with no set up fee – just a monthly subscription cost with no lock-in

Unified Analytics | Genomics Hands-on lab

Regional Event

Cambridge, UK

In this workshop, we’ll walkthrough how the Databricks Unified Analytics Platform for Genomics simplifies the end-to-end process of turning raw sequencing data into actionable insights at scale. Introduced by the original creators of Apache Spark, this platform makes it simple to deploy Spark-based bioinformatics tools on cloud computing, and rapidly accelerates common genomic analyses. Join this half day technical workshop to learn how to: - Call variants, both in a single sample and across multiple samples, using our accelerated GATK4 pipelines - Use Spark SQL to characterise the association of variants in a population with phenotypes - Use machine learning to model genome-wide disease risk across multiple variants associated with a phenotype of interest