CIOPages
DirectoryData & AnalyticsData Governance & CatalogKylo

Kylo

Open SourceFunded

Open-source data lake management with governance and self-service ingestion

Visit Website

About Kylo

Kylo is an open-source enterprise-ready platform designed to simplify data lake management by enabling self-service data ingestion, preparation, and governance. It empowers data owners and analysts to ingest, cleanse, validate, and transform data through an intuitive UI, reducing reliance on engineering teams. Kylo integrates metadata management, security, and data quality best practices, helping organizations maintain trust and control over their data assets.

Built on Apache NiFi and Apache Spark, Kylo supports batch and streaming data pipelines with customizable templates, enabling IT teams to extend capabilities while maintaining governance. Its integrated metadata repository facilitates data discovery, lineage visualization, and profiling, ensuring users can find and trust data within the lake. Kylo also provides feed-centric monitoring to track SLAs and troubleshoot data pipeline health, making it suitable for large enterprises seeking scalable, governed data lake solutions.

Key Capabilities

  • Self-service data ingestion with validation and profiling
  • Visual data preparation using interactive SQL transformations
  • Integrated metadata catalog with search and lineage
  • Feed-centric monitoring with SLA tracking
  • Pipeline template design with Apache NiFi integration

Integrations

Apache NiFiApache SparkHive

This profile was compiled by CIOPages from public sources with AI assistance, and may be incomplete or out of date. It is informational only and not an endorsement. Represent this vendor? or .

Quick Facts

kylo.io
CategoryData & Analytics
SubcategoryData Governance & Catalog
PricingOpen Source
DeploymentOpen Source, On-Premises
Target SizeEnterprise