Data modeling is the process of defining how data is organized, structured, and related within a database or lakehouse. The right modeling approach determines how easily different types of queries can be answered, how maintainable the schema is as business requirements evolve, and how efficiently storage is used. Modern lakehouses typically employ different modeling approaches at different layers of the data stack.

The Medallion Architecture Layers

Most Iceberg lakehouse teams organize their data in a medallion (multi-tier) architecture where modeling approach evolves from raw to refined:

Choosing the Right Model

Master the Agentic Lakehouse

Architecting an Apache Iceberg Lakehouse

Architecting an Apache Iceberg Lakehouse

Buy on Manning
The AI Lakehouse

The AI Lakehouse

Buy on Amazon