Explore Projects

Discover 8 open source projects

Active filters (1):
Search: data-profilingร—
Clear all

Showing 1-8 of 8 projects

Data-Centric-AI-Community/ydata-profiling

A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.

13.4K
Active
Python
Data Profiling
Python
#data-profiling#exploratory-data-analysis#data-quality

cleanlab/cleanlab

An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.

11.4K
Active
Python
Data Quality
Python
#data-centric-ai#data-quality#data-cleaning

great-expectations/great_expectations

A Python library that helps ensure data quality and reliability through data profiling and testing.

11.2K
Active
Python
ETL & Pipelines
#data-quality#data-testing#data-profiling

open-metadata/OpenMetadata

A unified metadata platform for data discovery, data observability, and data governance.

8.8K
Active
TypeScript
Data Catalog
Data Governance
TypeScript
#data-discovery#data-lineage#data-quality

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

opendatadiscovery/odd-platform

First open-source data discovery and observability platform for data practitioners.

1.4K
Active
Java
Data Discovery
Data Observability
#data-catalog#data-engineering#data-governance

cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

1.2K
Active
Python
Computer Vision
Data Exploration
Python
#computer-vision#data-quality#data-profiling

rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

1.0K
Active
R
Data Validation
Testing
#data-quality#data-validation#data-testing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.