Skip to content

Metaxy + DuckDB

Metaxy implements DuckDBMetadataStore. It uses DuckDB as metadata storage and versioning engine.

Warning

DuckDB does not (currently) support concurrent writes. If multiple writers are a requirement (e.g. with distributed data processing), consider either using DuckLake with a PostgreSQL catalog, or refer to DuckDB's documentation to learn about implementing application-side work-arounds.

Tip

The Delta Lake metadata store might be a better alternative for concurrent writes.

Installation

pip install 'metaxy[duckdb]'

Extensions

DuckDB extensions can be loaded automatically:

store = DuckDBMetadataStore("metadata.db", extensions=["hashfuncs", "spatial"])

hashfuncs is typically used by the versioning engine.

Reference