Explore Projects

Discover 382 open source projects

Active filters (1):
Search: datasetร—
Clear all

Showing 281-300 of 382 projects

kitops-ml/kitops

An open-source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifact.

1.3K
Active
Go
MLOps
Containerization
Go
#ai#devops#oci

google-deepmind/rc-data

A question answering dataset for building AI-powered language models and conversational agents.

1.3K
Archived
Python
LLM Frameworks
Datasets
#question-answering#natural-language-processing#dataset

halaxa/json-machine

Efficient, easy-to-use, and fast PHP JSON stream parser for developers working with large JSON datasets.

1.3K
Stable
PHP
API Frameworks
Parsing
#json-parsing#stream-processing#php

wainshine/Company-Names-Corpus

A corpus of company names, abbreviations, and brands that can be used for Chinese text segmentation and entity recognition.

1.3K
Archived
Datasets
CLI Tools
#corpus#dataset#ner

ai-boost/awesome-ai-for-science

A curated list of awesome AI tools, libraries, papers, datasets, and frameworks for scientific discovery.

1.3K
Active
AI for Science
Awesome Lists
#ai-for-science#bioinformatics#awesome-list

konrad-gajdus/miniMNIST-c

A minimalist C implementation of the MNIST dataset for machine learning experiments.

1.3K
Archived
C
ML Ops
CLI Tools
#mnist#machine-learning#c-language

jayleicn/animeGAN

A simple PyTorch implementation of Generative Adversarial Networks for generating anime-style faces.

1.3K
Archived
Jupyter Notebook
Computer Vision
Example Projects
PyTorch
#generative-adversarial-network#anime#computer-vision

streamlit/demo-self-driving

A Streamlit app that demonstrates real-time object detection on the Udacity self-driving-car dataset.

1.3K
Active
Python
Computer Vision
Charts & Visualization
Streamlit
#computer-vision#object-detection#yolo

jhc13/taggui

A Python-based tool for managing and captioning image datasets, with support for various AI models and frameworks.

1.3K
Stable
Python
Computer Vision
Component Libraries (React)
#image-tagging#image-captioning#llava

kyzhouhzau/BERT-NER

A Python library that uses Google's BERT for named entity recognition on the CoNLL-2003 dataset.

1.3K
Archived
Python
NER
API Frameworks
TensorFlow
#bert#conll-2003#google-bert

datitran/raccoon_dataset

This GitHub repository contains a dataset for training a raccoon detector using TensorFlow.

1.3K
Archived
Jupyter Notebook
Computer Vision
Datasets
#tensorflow#computer-vision#dataset

YelpArchive/dataset-examples

Sample datasets for users of the Yelp Academic Dataset, useful for data analysis and machine learning.

1.3K
Archived
Python
Databases
Example Projects
#dataset#yelp#data-analysis

google-research/deduplicate-text-datasets

A Rust library for deduplicating text datasets, potentially useful for machine learning projects.

1.3K
Archived
Rust
Data & Databases
CLI Tools
#data-deduplication#text-processing#machine-learning

WillKoehrsen/machine-learning-project-walkthrough

A machine learning project walkthrough in Python demonstrating the end-to-end ML pipeline on a real-world dataset.

1.3K
Archived
Jupyter Notebook
ML Ops
#machine-learning#data-science#jupyter-notebook

kakaobrain/coyo-dataset

A large-scale image-text dataset for training AI models, primarily focused on visual AI and multimodal AI tasks.

1.3K
Archived
Python
Computer Vision
Agents & Orchestration
#computer-vision#multimodal-ai#dataset

Renumics/spotlight

Interactively explore unstructured datasets like audio, images, and video using this TypeScript library.

1.3K
Active
TypeScript
Computer Vision
Caching
React
#data-visualization#exploratory-data-analysis#unstructured-data

AtmaHou/Task-Oriented-Dialogue-Research-Progress-Survey

A comprehensive survey of task-oriented dialogue research, including datasets and state-of-the-art methods.

1.2K
Archived
LLM Frameworks
Tutorials & Courses
#task-oriented-dialogue#language-models#dataset-survey

utiasSTARS/pykitti

Python tools for working with the KITTI computer vision and robotics dataset

1.2K
Archived
Python
Computer Vision
API Frameworks
Python
#computer-vision#kitti-dataset#robotics

facebookresearch/Replica-Dataset

Replica Dataset is a collection of AI-generated data for training and testing machine learning models.

1.2K
Archived
C++
React
#machine learning#ai data#training dataset

manami-project/anime-offline-database

This repository provides a comprehensive JSON dataset containing metadata on anime series, movies, and cross-references to various anime sites.

1.2K
Active
Makefile
Databases
General Utilities
#anime#database#dataset
1...1416...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.