Explore Projects

Discover 8 open source projects

Active filters (1):
Search: data-cleaningร—
Clear all

Showing 1-8 of 8 projects

cleanlab/cleanlab

An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.

11.4K
Active
Python
Data Quality
Python
#data-centric-ai#data-quality#data-cleaning

voxel51/fiftyone

Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.

10.4K
Active
Python
Computer Vision
Python
#active-learning#data-curation#data-quality

johnkerl/miller

Miller is a powerful CLI tool for processing tabular data like CSV, TSV, and JSON, similar to awk, sed, and other Unix utilities.

9.8K
Active
Go
CLI Tools
#csv#json#data-processing

OpenDCAI/DataFlow

LLMs-based Operators and Pipelines for data prep

2.9K
Active
Python
AI Coding Tools
Gradio
#data-science#data-agent#data-cleaning

justmarkham/DAT8

General Assembly's 2015 Data Science course covering topics like machine learning, data analysis, and data visualization.

1.6K
Archived
Jupyter Notebook
Tutorials & Courses
Jupyter Notebook
#data-analysis#data-science#machine-learning

hi-primus/optimus

Agile data preparation workflows made easy with popular Python data science libraries.

1.5K
Archived
Python
ETL & Pipelines
API Frameworks
#big-data-cleaning#data-analysis#data-cleaning

sfirke/janitor

A collection of simple tools for data cleaning and wrangling in R for data science tasks.

1.4K
Archived
R
Data Cleaning & Wrangling
#data-analysis#data-cleaning#data-science

data-forge/data-forge-ts

A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.

1.4K
Stable
TypeScript
Data Transformation & Analysis
Frontend Frameworks
React
#data-transformation#data-analysis#data-manipulation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.