Showing 1-20 of 140 projects
Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.
YOLOv5 is a state-of-the-art computer vision model for object detection, segmentation, and classification.
State-of-the-art diffusion models for image, audio, and video generation in PyTorch.
Object detection toolbox for PyTorch with support for multiple tasks and state-of-the-art models.
2D/3D face analysis with AI
State-of-the-art open-source TTS models for high-quality voice generation
AudioCraft is a PyTorch library for audio generation with deep learning models like MusicGen and AudioGen.
Tracks progress in NLP tasks with datasets and benchmarks
Parameter-efficient fine-tuning for large models
State-of-the-art text embedding library for building advanced natural language processing applications.
Run state-of-the-art ๐ค Transformers AI models directly in the browser, without a server.
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.
A simple, state-of-the-art NLP framework for tasks like named entity recognition and semantic role labeling.
An implementation of the YOLOv7 state-of-the-art real-time object detection model.
TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
Foundational models for state-of-the-art speech and text translation
A lightweight, state-of-the-art text-to-speech (TTS) model for developers building AI-powered applications.
StyleGAN2 is an official TensorFlow implementation of a state-of-the-art generative adversarial network.
Get weekly updates on trending AI coding tools and projects.