Showing 1-20 of 59 projects
Provides access and documentation for the Quick, Draw! dataset, a large collection of doodles used for machine learning research.
A repository that collects, organizes, and publishes Chinese natural language processing (NLP) datasets to advance the development of Chinese NLP.
A curated list of free/public domain text datasets for natural language processing (NLP) tasks.
A repository for preparing large datasets for training large language models (LLMs).
A PyTorch repository for practicing image classification on the CIFAR-100 dataset using various deep learning models.
A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.
CLUE is a comprehensive Chinese language understanding evaluation benchmark with datasets, baselines, pre-trained models, and a leaderboard.
A PyTorch-powered library for loading and processing text data for natural language processing tasks.
Waymo Open Dataset is a large-scale dataset for autonomous driving research and development.
A comprehensive collection of papers and datasets for 3D point cloud processing, useful for developers working on autonomous driving and computer vision.
A large collection of system log datasets for AI-driven log analytics.
A dataset for music analysis and research, with support for deep learning and reproducible research.
An awesome curated list of medical-related AI/ML resources including LLMs, datasets, and benchmarks.
A diverse and well-annotated dataset for license plate detection and recognition
A curated collection of open-source Chinese medical NLP resources including datasets, models, and more.
A comprehensive Python library for color science and color space conversions.
CodeSearchNet provides datasets, tools, and benchmarks for representation learning of code, enabling AI-powered code discovery.
Starter code for working with the YouTube-8M dataset, a large-scale video understanding dataset.
A dataset of annotated 3D object videos for training computer vision and augmented reality models.
A video foundation model and dataset for multimodal understanding and video understanding tasks.
Get weekly updates on trending AI coding tools and projects.