LAKEHOUSE STORAGE

Built for open, intelligent data storage

Choose your storage location and format, with full ownership and portability of your data.

TOP TEAMS SUCCEED WITH DATA INTELLIGENCE

benefits

Lakehouse storage that’s flexible and fast

Eliminate data management headaches with open table formats, centralized governance and automatic data optimizations.

Compatible formats

A single copy of source data in Delta Lake or Apache Iceberg™ that can be accessed by any engine.

Unified governance

A single catalog for data discovery and governance, across your data and AI assets.

AI-driven performance

AI-powered models autonomously optimize and maintain data for speed and low cost.

Features

Your data, your way

Choose the storage location and open format that works for you. Keep your data portable, without vendor lock-in.

Best-in-class read and write performance for Delta Lake and Apache Iceberg™ tables, out of the box, with storage optimizations not available in any other lakehouse.

More about managed tables

Access tables that are managed by external catalogs like Glue, HMS and Snowflake Horizon and leverage advanced Unity Catalog features like fine-grained access controls.

More about foreign tables

The Unity REST and Iceberg REST Catalog APIs unlock the entire lakehouse ecosystem, across formats and engines.

More about using external systems

More features

ACID Transactions

Atomicity, consistency, isolation and durability guarantees provided by open table format protocols.

Learn more

Predictive Optimization

AI-driven table optimizations based on your data and usage patterns that keep your tables tuned, automatically.

Learn more

Liquid Clustering

Out-of-the-box, self-tuning data layout that scales with your data — no partitions required.

Learn more

Change Data Feed

Track row-level changes between versions of a Delta table.

Learn more

Time Travel

Historical information about tables lets you audit operations, roll back a table or query a table at a specific point in time.

Learn more

Structured Streaming

Integration with Apache Spark™ Structured Streaming, a near real-time processing engine that offers end-to-end fault tolerance with exactly-once processing guarantees.

Learn more

USE CASES

For all your analytics and AI workloads

Run analytics and BI workloads directly on your data lake

Delta Lake and Apache Iceberg allow you to operate multicloud lakehouse architecture that provides data warehousing performance at data lake economics for up to 6x better price/performance for SQL workloads than traditional cloud data warehouses.

Learn more