Iceberg REST Catalog

At the very top of the Apache Iceberg architecture sits the Catalog layer. It is responsible for tracking the current state of a table by storing a pointer to the latest metadata file. Initially, organizations had to use specific catalog implementations like the Hive Metastore, AWS Glue, or direct database connections (JDBC) to manage these pointers. This created a problem: if a new query engine wanted to read Iceberg tables, it had to build custom integration code for every possible catalog backend.

The Iceberg REST Catalog API solves this problem. It is an open, vendor-neutral specification that standardizes how compute engines communicate with catalog services over HTTP. Instead of an engine needing to know the low-level details of how a specific catalog stores its data, the engine simply makes standardized REST API calls (like `GET /v1/namespaces/{namespace}/tables/{table}` to fetch table metadata, or `POST /v1/namespaces/{namespace}/tables/{table}` to commit an update).

Why the REST API Matters

The REST Catalog API is the linchpin of true vendor neutrality in the modern data lakehouse.

Eliminates Lock-in: An organization can deploy a REST-compliant catalog service (like Apache Polaris or Dremio Arctic) and know that any query engine supporting the REST standard (Spark, Flink, Trino, Snowflake, etc.) can instantly connect to it. If the organization later decides to migrate to a different catalog provider, they can do so without breaking their compute engines, provided the new catalog also implements the REST API.
Centralized Governance: By routing all table operations through a standardized REST interface, the catalog service becomes the natural place to enforce central governance. The catalog can intercept requests to verify authentication, enforce Role-Based Access Control (RBAC), and manage credential vending for the underlying object storage, ensuring security policies are applied consistently regardless of which engine submitted the query.
Future-Proofing: New capabilities (like Iceberg Spec v3 features or materialized view definitions) can be supported by updating the REST API specification, allowing the entire ecosystem of clients and servers to adopt new features coherently rather than relying on fragmented, implementation-specific updates.

Implementations

The REST API is just a specification. It requires a server to implement it. **Apache Polaris**, which graduated to a Top-Level Apache Project in early 2026, is the premier open-source implementation of the Iceberg REST Catalog API. Managed services from vendors like Snowflake and Dremio also expose REST-compliant endpoints, allowing organizations to choose between self-hosting an open-source solution or consuming a managed service, all while retaining the benefits of a standardized interface.