The Data Lakehouse 101
Learn the architecture that combines the flexibility of data lakes with the performance and governance of data warehouses.
What Is a Data Lakehouse?
The data lakehouse is a paradigm that uses a data lake as a data warehouse by creating open tables using table formats such as Apache Iceberg. These tables provide data-warehouse-style performance, reliability, and governance while remaining compatible with the broader data ecosystem. This approach enables ACID guarantees, open interoperability, and flexible analytics across engines and tools.
Learn More
Data Lakehouse Podcasts
Essential Data Lakehouse Books
Lakehouse Communities
Data Lakehouse Hub
Dremio Developer Lakehouse and AI Community
📚 The Data Lakehouse Knowledge Base
100 definitive guides on every data lakehouse term — from Apache Iceberg internals to Dremio architecture, ETL patterns, and agentic AI.