The Kimball Methodology, developed by Ralph Kimball and documented in "The Data Warehouse Toolkit" (1996, with subsequent editions), is the most widely practiced approach to dimensional data modeling. Its core philosophy is that data warehouses (and by extension, lakehouses) should be designed around business processes and organized to be queryable by business users, not just technically optimized for storage efficiency.

Core Kimball Concepts

Kimball in Modern Lakehouses

Kimball methodology is directly applicable to Apache Iceberg lakehouses. Fact tables become large Iceberg tables partitioned by date (the most common query filter). Dimension tables are smaller Iceberg tables that load efficiently into memory. dbt (Data Build Tool) is the most popular tool for implementing Kimball models in modern lakehouses: dbt models transform Silver-layer Iceberg tables into Gold-layer Kimball fact and dimension tables using declarative SQL.

Master the Agentic Lakehouse

Architecting an Apache Iceberg Lakehouse

Architecting an Apache Iceberg Lakehouse

Buy on Manning
The AI Lakehouse

The AI Lakehouse

Buy on Amazon