Explore Projects

Discover 382 open source projects

Active filters (1):
Search: datasetsร—
Clear all

Showing 161-180 of 382 projects

apple/ml-hypersim

A photorealistic synthetic dataset for holistic indoor scene understanding using machine learning.

2.0K
Active
Python
Computer Vision
#computer-vision#machine-learning#dataset

eosphoros-ai/DB-GPT-Hub

A repository providing models, datasets, and fine-tuning techniques for DB-GPT to enhance Text-to-SQL performance.

2.0K
Experimental
Python
React
#authentication#fine-tuning#text-to-sql

apple/ml-cvnets

A computer vision library for training and deploying deep learning models, with support for popular datasets and tasks.

2.0K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#computer-vision#deep-learning#image-classification

alias-rahil/handwritten.js

A JavaScript library that converts typed text into realistic handwriting with customizable fonts and styles.

2.0K
Archived
JavaScript
Animation & Motion
JavaScript
#handwriting#text-to-image#font-customization

youngfish42/Awesome-FL

Comprehensive collection of federated learning resources (papers, frameworks, datasets, tutorials, etc.)

2.0K
Stable
Python
LLM Frameworks
Databases
#federated-learning#machine-learning#artificial-intelligence

WuJie1010/Facial-Expression-Recognition.Pytorch

A state-of-the-art PyTorch implementation for facial expression recognition, useful for developers working on computer vision and AI applications.

1.9K
Archived
Python
Computer Vision
Inference
PyTorch
#facial-expression-recognition#computer-vision#pytorch

google-deepmind/mathematics_dataset

This dataset generates mathematical questions and answers for school-level difficulty, useful for AI/ML research.

1.9K
Archived
Python
LLM Frameworks
Coding Challenges
#mathematics#dataset#machine-learning

thu-coai/CDial-GPT

Large-scale Chinese Short-Text Conversation Dataset and pre-training dialog models for AI-powered developers

1.9K
Archived
Python
PyTorch
#dialogue#text-generation#gpt2

shramos/Awesome-Cybersecurity-Datasets

A curated list of cybersecurity datasets for security researchers and machine learning practitioners.

1.9K
Archived
Security Research
Datasets
#cybersecurity#dataset#security-research

youngguncho/awesome-slam-datasets

A curated list of awesome datasets for Simultaneous Localization and Mapping (SLAM) research and development.

1.9K
Archived
library
#slam#computer-vision#robotics

Guang000/Awesome-Dataset-Distillation

A curated list of awesome papers on dataset distillation and related AI/ML applications.

1.9K
Active
HTML
Machine Learning Ops
Tutorials & Courses
React
#dataset-distillation#machine-learning#ai

diffgram/diffgram

An AI datastore for managing schemas, BLOBs, and predictions to build AI-powered applications.

1.9K
Archived
Python
Data Annotation
Databases
Python
#data-annotation#data-management#machine-learning

apache/kudu

Apache Kudu is a high-performance, open-source columnar storage engine for large datasets in the Apache Hadoop ecosystem.

1.9K
Active
C++
Databases
API Frameworks
#big-data#cplusplus#open-source

AutoViML/AutoViz

Automatically visualize any dataset, any size with a single line of code, built for vibe coders.

1.9K
Archived
Python
Visualization
Data Visualization
Python
#data-visualization#machine-learning#automated-ml

h2oai/datatable

A high-performance, memory-efficient Python data analysis library for handling large datasets.

1.9K
Experimental
C++
Databases
CLI Tools
Python
#data-analysis#performance#memory-efficient

uber/petastorm

Petastorm enables training and evaluation of deep learning models from Apache Parquet datasets.

1.9K
Active
Python
ML Ops
Databases
PyTorch
#deep-learning#machine-learning#data-processing

starik222/BooruDatasetTagManager

A C# tool for managing tags in Booru-style image datasets, useful for AI/ML developers working with visual data.

1.9K
Stable
C#
Computer Vision
Databases
#computer-vision#image-datasets#tagging

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K
Experimental
Speech Processing
Awesome Lists
#speaker-diarization#speech-recognition#machine-learning

visual-layer/fastdup

Accelerate data curation and augmentation with this scalable, free tool for image and video analysis.

1.8K
Stable
Python
Computer Vision
ETL & Pipelines
Python
#data-augmentation#data-curation#image-processing

MIT-SPARK/Kimera-VIO

A C++ library for visual-inertial odometry and simultaneous localization and mapping (SLAM) with 3D mesh generation.

1.8K
Experimental
C++
Computer Vision
API Frameworks
#localization#mapping#robotics
1...810...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.