Explore Projects

Discover 7 open source projects

Active filters (1):
Search: data-curationร—
Clear all

Showing 1-7 of 7 projects

cleanlab/cleanlab

An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.

11.4K
Active
Python
Data Quality
Python
#data-centric-ai#data-quality#data-cleaning

voxel51/fiftyone

Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.

10.4K
Active
Python
Computer Vision
Python
#active-learning#data-curation#data-quality

Docta-ai/docta

A Python library that helps diagnose and curate datasets for data-centric AI applications.

3.5K
Archived
Python
LLM Frameworks
Caching
#data-curation#data-diagnosis#language-model

visual-layer/fastdup

Accelerate data curation and augmentation with this scalable, free tool for image and video analysis.

1.8K
Stable
Python
Computer Vision
ETL & Pipelines
Python
#data-augmentation#data-curation#image-processing

NVIDIA-NeMo/Curator

Scalable data pre processing and curation toolkit for Large Language Models (LLMs)

1.4K
Active
Python
Python
#data-curation#large-language-models#data-preparation

Renumics/spotlight

Interactively explore unstructured datasets like audio, images, and video using this TypeScript library.

1.3K
Active
TypeScript
Computer Vision
Caching
React
#data-visualization#exploratory-data-analysis#unstructured-data

daochenzha/data-centric-AI

A curated list of resources for data-centric AI development, including tools, frameworks, and best practices.

1.1K
Archived
LLM Frameworks
Databases
#data-centric-ai#machine-learning#data-science

Stay in the loop

Get weekly updates on trending AI coding tools and projects.