CIOPages
DirectoryData & AnalyticsData Governance & CatalogMarquez

Marquez

Open SourceFunded

Open source metadata platform for real-time data lineage and governance

Visit Website

About Marquez

Marquez is an open source metadata service designed to collect, store, and visualize data lineage and metadata across an enterprise's data ecosystem. It serves data engineering, analytics, and governance teams by providing a unified view of data dependencies, job executions, and dataset lineage. Marquez acts as a central source of truth for metadata, enabling organizations to monitor data quality, trace data provenance, and perform root cause analysis efficiently.

The platform offers a real-time metadata server compatible with the OpenLineage standard, supporting integrations with popular data orchestration and processing tools such as Apache Airflow, Apache Spark, and dbt. Its web-based interface presents a visual graph of complex data interdependencies, making it easier for CIOs and data leaders to understand data flows and ensure compliance with governance policies. Marquez’s lineage API facilitates automation of key workflows, enhancing data catalog enrichment and operational analytics across pipelines.

Key Capabilities

  • Real-time OpenLineage-compatible metadata server
  • Unified visual graph of data lineage and dependencies
  • Lineage API for automation and root cause analysis
  • Integration with Apache Airflow, Spark, Flink, dbt, Dagster
  • Centralized metadata storage for governance and monitoring

Integrations

Apache AirflowApache Sparkdbt

This profile was compiled by CIOPages from public sources with AI assistance, and may be incomplete or out of date. It is informational only and not an endorsement. Represent this vendor? or .

Quick Facts

marquezproject.ai
CategoryData & Analytics
SubcategoryData Governance & Catalog
PricingOpen Source
DeploymentOpen Source
Target SizeEnterprise