Delta Lake is an open source storage layer that brings reliability to data lakes. It has numerous reliability features including ACID transactions, scalable metadata handling, and unified streaming and batch data processing. It also offers DML commands to update, delete, and merge data for your data lifecycle, such as for GDPR/CCPA. Delta Lake runs on top of your existing data lake, such as on Azure Data Lake Storage, AWS S3, Hadoop HDFS, or on-premise, and is fully compatible with Apache Spark APIs.
Join Databricks and Microsoft to learn how to leverage Azure best practices for implementing a complete data science lifecycle, enabling data teams to scale effectively using Azure Databricks, MLflow and other Azure services. In this workshop, you will learn how to train models and create predictions with Azure Databricks. You'll also learn how to track experiments and tune hyperparameters with MLflow. Finally, we'll walk through how to deploy and serve models with MLflow and other Azure services.
In this virtual workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your data and ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll learn how to use ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production.
In this workshop, we’ll discuss how organizations that successfully embedded ESG at the core of their business have built the operational resilience required to better adapt to emerging threats. We’ll demonstrate a novel approach to supply chain analytics by combining geospatial techniques and predictive analytics to reduce the carbon footprint, improve working conditions and enhance regulatory compliance.
In this virtual workshop, we’ll cover best practices for organizations to use powerful open source technologies to build and extend your AWS investments to make your data lake analytics ready. You’ll learn about the advantages of cloud-based data lakes in terms of security and cost. And finally, you’ll learn how data professionals are having a huge impact - lowering costs, changing time to market, and even revolutionizing industries.
In this virtual workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your Data and ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use Data and ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements.
This virtual conference will highlight to data teams the tangible results that can be achieved, and best-of-breed practices to scale the impact of Data & Analytics across revenue generation, customer retention, service satisfaction and cost efficiency levers. Join this virtual event, which is proudly supported by Databricks.
Join this session for a one-hour deep-dive on how companies can apply advanced analytics to geospatial datasets and deliver on a broad range of use cases like mining exploration, oil discovery, asset inspection, flood surveys, environment protection, facility management, transportation planning, fraud detection, and more.
This 3 hour tutorial is suited for SQL analysts and developers who want to get hands-on experience and a deeper knowledge of the benefits of using SQL on the Databricks Unified Data Analytics Platform. Domain knowledge and familiarity of SQL is required for this session.
Join Databricks and Microsoft as we share how you can easily query your data lake using SQL and Delta Lake on Azure. We’ll show how Delta Lake enables you to run SQL queries without moving or copying your data. We will also explain some of the added benefits that Azure Databricks provides when working with Delta Lake.