CIOPages
DirectoryData & AnalyticsData Integration & ETLBruin

Bruin

Open SourceFunded

End-to-end open source data ingestion, transformation, and quality framework

Visit Website

About Bruin

Bruin is an open source enterprise-grade data framework designed to streamline data ingestion, transformation, and quality assurance across diverse platforms. It enables organizations to ingest data from over 70 sources, execute SQL, Python, and R transformations, and manage data quality with built-in checks, glossaries, and policies. Bruin supports modern lakehouse architectures including Iceberg and Delta Lake, facilitating scalable and incremental data workflows.

Targeted at data engineering and analytics teams within large enterprises, Bruin offers robust pipeline orchestration with concurrency, scheduling, and dry-run validation capabilities. It integrates with popular data warehouses and lakes such as AWS Athena, Snowflake, Google BigQuery, and Databricks, providing seamless connectivity and cross-platform data lineage visualization. The platform also supports secret management, developer tooling including a VS Code extension, and AI/LLM integration, making it a comprehensive solution for complex data environments.

Key Capabilities

  • Ingest data from 70+ diverse sources
  • Run SQL, Python, and R transformations
  • Support for lakehouse architectures with Iceberg and Delta Lake
  • Built-in data quality checks and policies
  • Visualize data lineage and compare tables across connections

Integrations

AWS AthenaSnowflakeGoogle BigQuery

This profile was compiled by CIOPages from public sources with AI assistance, and may be incomplete or out of date. It is informational only and not an endorsement. Represent this vendor? or .

Quick Facts

bruin-data.github.io/bruin
CategoryData & Analytics
SubcategoryData Integration & ETL
PricingOpen Source
DeploymentOpen Source, Cloud
Target SizeEnterprise