Pangolin is an MIT licensed Open Source Lakehouse Catalog written in Rust. Designed for enthusiasts who want multi-table consistency, catalog federation, multi-tenancy, and business metadata catalog features in one catalog. Pangolin is currently in alpha and not ready for production. Contributions are welcome!
Passion project by Alex Merced.
Built with Rust for blazing fast API responses and low resource footprint.
Deploy anywhere. First-class support for AWS S3, Azure Blob Storage, and Google Cloud Storage.
Store your metadata in PostgreSQL, MongoDB, or SQLite. Choose what fits your stack.
Built-in Role-Based Access Control (RBAC) and Multi-Tenancy support out of the box.
Tag, document, and organize your data assets with validatable business metadata.
Experiment safely with Zero-Copy branching for your data catalogs.
Granular permission management using tags for easier governance at scale.
A beautiful, modern interface to manage your catalogs, users, and permissions.
Full control of your catalog from the terminal for automation and power users.
Easily find dataset across all your catalogs with robust search and filtering.
Securely vend temporary credentials for S3, Azure, and GCS to compute engines.
Fully compliant implementation of the Iceberg REST Catalog specification.
A clean, modern management UI for your Data Lakehouse.
Full-featured Python library for Pangolin with PyIceberg integration, multi-format support, and secure database connection management.
Now available on PyPI: pip install pypangolin
PyPangolin provides a comprehensive Python interface to all Pangolin features, plus specialized support for popular data formats and database connections.
View on PyPI → Documentation →Full PyIceberg integration with read/write operations
✅ TestedRead/write Delta tables with automatic registration
✅ TestedRegister and manage Hudi tables
SupportedRegister and track Paimon tables
SupportedRead/write Parquet files with metadata
✅ TestedRead/write structured files
✅ TestedRead/write vector database format
✅ TestedHigh-performance columnar format
SupportedSecurely store and manage database credentials with Fernet encryption
Encrypted credential storage
✅ TestedSecure connection sharing
✅ TestedNoSQL database connections
✅ TestedCloud data warehouse
⚠️ UntestedAWS data warehouse
⚠️ UntestedGoogle Cloud analytics
⚠️ UntestedMicrosoft analytics service
⚠️ UntestedArrow Flight connections
✅ TestedRBAC, permissions, service users, and business metadata
Audit logging, search, token management, and system config
Connect to remote Iceberg catalogs and create SQL views
Branching, merging, tagging with conflict resolution
The production-grade community-run lakehouse catalog. Best for large-scale open source deployments.
Visit Polaris →A managed version of Polaris with a built-in semantic layer, federated queries, and AI-powered autonomous optimization.
Try Dremio Cloud →Discuss Pangolin, Iceframe, and Dremioframe in the #pangolin-catalog channel on the Data Lakehouse Hub Slack.
Join Slack