Showing 1-8 of 8 projects
An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.
Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.
Miller is a powerful CLI tool for processing tabular data like CSV, TSV, and JSON, similar to awk, sed, and other Unix utilities.
LLMs-based Operators and Pipelines for data prep
General Assembly's 2015 Data Science course covering topics like machine learning, data analysis, and data visualization.
Agile data preparation workflows made easy with popular Python data science libraries.
A collection of simple tools for data cleaning and wrangling in R for data science tasks.
A TypeScript toolkit for data transformation and analysis inspired by Pandas and LINQ.
Get weekly updates on trending AI coding tools and projects.