At the highest level of the Apache Iceberg architecture sits the Catalog. You can think of the Catalog as the "front desk" of your data lakehouse. When a query engine (like Spark, Flink, or Trino) wants to read from or write to an Iceberg table, it cannot simply scan an object storage bucket. It must first ask the Catalog where the table's current metadata is located.

The Role of the Catalog

The core responsibility of the Iceberg Catalog is to store a mapping between a logical table name (e.g., marketing.campaign_results) and the absolute URI of its most recent JSON Metadata File in object storage. The Catalog does not store the data files, nor does it store the bulk of the metadata (like manifest lists or manifest files). It simply stores the pointer to the root of the metadata tree.

Enabling ACID Transactions

Because object storage systems (like Amazon S3 or Google Cloud Storage) are designed for eventual consistency and do not natively support atomic file updates or multi-file transactions, Iceberg delegates the transaction locking mechanism to the Catalog layer.

When an engine finishes writing new data, it creates a new metadata JSON file. To finalize the commit, the engine asks the Catalog to swap the pointer from the old metadata file to the new one. The Catalog must provide an atomic Compare-and-Swap (CAS) operation. If the pointer has not changed since the engine started its transaction, the swap succeeds. If another engine changed the pointer first, the catalog rejects the commit, forcing the losing engine to retry. This is how Iceberg guarantees ACID compliance across concurrent writers.

Types of Catalogs

Iceberg's modular design means organizations can plug in different catalog implementations depending on their infrastructure:

Master the Agentic Lakehouse

Start building today with free trials and authoritative resources.

Architecting an Apache Iceberg Lakehouse

Architecting an Apache Iceberg Lakehouse

Buy on Manning
The AI Lakehouse

The AI Lakehouse

Buy on Amazon
Apache Iceberg and Agentic AI

Apache Iceberg and Agentic AI

Buy on Amazon
Lakehouse Built for Everyone

Lakehouse Built for Everyone

Buy on Amazon