Star Schema vs Snowflake Schema

Dimensional data modeling organizes data into fact tables (measuring business events like sales and clicks) and dimension tables (describing the context of those events: customers, products, dates). The arrangement of these tables relative to each other defines the schema shape, with two primary patterns: the Star Schema and the Snowflake Schema.

Star Schema

In a Star Schema, a central fact table (e.g., orders) directly references multiple flat dimension tables (dim_customer, dim_product, dim_date). The name comes from the star-like diagram when fact and dimension tables are connected. Key characteristics:

Dimension tables are denormalized: all customer attributes (name, city, country, segment) live in a single dim_customer table
Queries require fewer joins, since all attributes for a given dimension are in one table
Storage is slightly higher due to denormalization (redundant city/country values stored per customer)
Preferred for BI tools and OLAP queries where join minimization reduces query complexity

Snowflake Schema

A Snowflake Schema normalizes dimension tables by splitting them into multiple related tables. The dim_customer table might contain customer_id and address_id, with address details in a separate dim_address table, and country details in a dim_country table. This creates a "snowflake" branch structure:

Reduces storage by eliminating repeated values (country 'US' stored once, referenced by all US addresses)
Requires more joins to answer typical analytical queries
More complex SQL, harder for BI tools to auto-generate

The Modern Lakehouse Verdict

In modern columnar lakehouse environments with MPP engines, the storage savings of snowflake schema are negligible (Parquet's columnar compression already eliminates most redundancy). The join overhead, however, is measurable in distributed query execution. Most practitioners default to Star Schema for analytical layers, keeping snowflake-style normalization only at the raw ingestion layer where data contracts with source systems require normalized structure.

Star Schema

Snowflake Schema

The Modern Lakehouse Verdict

Master the Agentic Lakehouse

Start Your Free Dremio Trial

Architecting an Apache Iceberg Lakehouse

The AI Lakehouse

Star Schema vs Snowflake Schema

Star Schema

Snowflake Schema

The Modern Lakehouse Verdict

Related Articles

Master the Agentic Lakehouse

Start Your Free Dremio Trial

Architecting an Apache Iceberg Lakehouse

The AI Lakehouse