Showing 81-100 of 382 projects
A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.
Easily convert large sets of image URLs into a dataset for AI/ML training and experimentation.
The Open Images dataset, a large-scale, diverse dataset of images that are annotated with object bounding boxes, visual relationships, and attributes.
A TypeScript-based tool for converting natural language queries into SQL using AI.
A Chinese name corpus and generator for natural language processing and entity recognition.
A full-featured CSV parser with a simple API and support for large datasets in Node.js.
CLUE is a comprehensive Chinese language understanding evaluation benchmark with datasets, baselines, pre-trained models, and a leaderboard.
A Jupyter Notebook project that helps developers label their own data and train custom AI models.
A database and paper collection for surface defect research, useful for developers building with AI tools.
TorchGeo is a Python library for working with geospatial data using PyTorch, providing datasets, samplers, transforms, and pre-trained models.
List of satellite image training datasets with annotations for computer vision and deep learning
A command-line tool for slicing and dicing log data in Rust, useful for developers who work with large datasets.
A synthetic data generator for text recognition, useful for training AI-powered text detection and OCR models.
This is a collection of classic and modern trojan builders, not a developer tool for AI-powered coding.
Deequ is a Scala library for defining "unit tests for data" to measure data quality in large datasets.
A PyTorch-powered library for loading and processing text data for natural language processing tasks.
A curated list of awesome JSON datasets that don't require authentication.
A TensorFlow-based example of human activity recognition using an LSTM RNN on smartphone sensor data.
A Python library that helps diagnose and curate datasets for data-centric AI applications.
A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.
Get weekly updates on trending AI coding tools and projects.