Explore Projects

Discover 382 open source projects

Active filters (1):
Search: datasetsร—
Clear all

Showing 21-40 of 382 projects

zalandoresearch/fashion-mnist

A MNIST-like fashion product database for computer vision and deep learning benchmarking.

12.7K
Archived
Python
Computer Vision
Python
#benchmark#computer-vision#deep-learning

pwxcoo/chinese-xinhua

A comprehensive Chinese dictionary dataset for developers working on Chinese NLP projects.

11.5K
Archived
Python
JSON Dataset
Python
#chinese#chinese-nlp#data

cleanlab/cleanlab

An open-source library for data-centric AI with tools for data quality and machine learning on messy, real-world data.

11.4K
Active
Python
Data Quality
Python
#data-centric-ai#data-quality#data-cleaning

salesforce/LAVIS

LAVIS is a comprehensive library for multimodal deep learning, including image captioning, visual question answering, and more.

11.2K
Archived
Jupyter Notebook
Vision-Language Transformer
PyTorch
#deep-learning#multimodal-learning#vision-language

dataelement/bisheng

An open LLM devops platform for building next-gen enterprise AI applications with powerful features like GenAI workflow, RAG, Agent, and model management.

11.1K
Active
TypeScript
LLM Frameworks
React
#ai#llm#genai

simonw/datasette

An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.

10.8K
Active
Python
Databases
#data-analysis#data-exploration#data-publishing

facebookresearch/ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

10.6K
Archived
Python
LLM Frameworks
PyTorch
#ai#machine-learning#dialogue

doccano/doccano

An open-source annotation tool for machine learning practitioners to label data for training models.

10.6K
Active
Python
Data Annotation
Nuxt
#annotation-tool#data-labeling#machine-learning

voxel51/fiftyone

Refine high-quality datasets and visual AI models with this Python library for active learning and data curation.

10.4K
Active
Python
Computer Vision
Python
#active-learning#data-curation#data-quality

perspective-dev/perspective

An open-source data visualization and analytics component well-suited for large or streaming datasets.

10.4K
Active
C++
Charts & Visualization
JavaScript
#data-visualization#analytics#streaming

BrasilAPI/BrasilAPI

A comprehensive Brazilian API providing access to various public datasets and services.

10.3K
Stable
JavaScript
API Frameworks
Node
#brazil#public-datasets#api-development

timzhang642/3D-Machine-Learning

A comprehensive repository of resources for 3D machine learning, including papers, datasets, and frameworks.

10.1K
Archived
Computer Vision
#3d-machine-learning#point-cloud#mesh

mozilla/TTS

A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.

10.1K
Archived
Jupyter Notebook
AI Voice & Speech
PyTorch
#text-to-speech#speech-generation#deep-learning

satellite-image-deep-learning/techniques

A collection of techniques for deep learning with satellite and aerial imagery, including object detection and classification.

10.0K
Active
Computer Vision
PyTorch
#satellite-imagery#remote-sensing#object-detection

brightmart/nlp_chinese_corpus

Large-scale Chinese natural language processing corpus for training and fine-tuning language models

9.9K
Stable
LLM Frameworks
#chinese#nlp#corpus

activeloopai/deeplake

Versatile database for AI, supporting storage, querying, versioning, and visualization of any AI data.

9.0K
Active
C++
LLM Frameworks
Vector Databases
PyTorch
#ai#data-storage#vector-database

NirantK/awesome-project-ideas

Curated list of Machine Learning, NLP, Vision, and Recommender Systems project ideas for developers.

9.0K
Archived
Machine Learning
Tutorials & Courses
#machine-learning#nlp#computer-vision

RedditSota/state-of-the-art-result-for-machine-learning-problems

This repository provides the latest state-of-the-art results for a wide range of machine learning problems.

8.9K
Archived
ML Ops
#machine-learning#benchmarks#datasets

Arize-ai/phoenix

AI observability and evaluation tooling for developers building with large language models and AI agents.

8.8K
Active
Jupyter Notebook
LLM Frameworks
Agents & Orchestration
Jupyter Notebook
#ai-monitoring#ai-observability#llm-evaluation

hardikvasa/google-images-download

A Python script to download hundreds of images from Google Images.

8.7K
Archived
Python
React
#image-download#google-images#python-script
13...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.