Marquez
Open SourceFundedOpen source metadata platform for real-time data lineage and governance
About Marquez
Marquez is an open source metadata service designed to collect, store, and visualize data lineage and metadata across an enterprise's data ecosystem. It serves data engineering, analytics, and governance teams by providing a unified view of data dependencies, job executions, and dataset lineage. Marquez acts as a central source of truth for metadata, enabling organizations to monitor data quality, trace data provenance, and perform root cause analysis efficiently.
The platform offers a real-time metadata server compatible with the OpenLineage standard, supporting integrations with popular data orchestration and processing tools such as Apache Airflow, Apache Spark, and dbt. Its web-based interface presents a visual graph of complex data interdependencies, making it easier for CIOs and data leaders to understand data flows and ensure compliance with governance policies. Marquez’s lineage API facilitates automation of key workflows, enhancing data catalog enrichment and operational analytics across pipelines.
Key Capabilities
- ✓Real-time OpenLineage-compatible metadata server
- ✓Unified visual graph of data lineage and dependencies
- ✓Lineage API for automation and root cause analysis
- ✓Integration with Apache Airflow, Spark, Flink, dbt, Dagster
- ✓Centralized metadata storage for governance and monitoring
Integrations
Other Data Governance & Catalog Vendors
View allRelated Buyer Guides
Independent evaluation frameworks for this category.
This profile was compiled by CIOPages from public sources with AI assistance, and may be incomplete or out of date. It is informational only and not an endorsement. Represent this vendor? or .