Explore Projects

Discover 36 open source projects

Active filters (1):
Search: vitsร—
Clear all

Showing 1-20 of 36 projects

labmlai/annotated_deep_learning_paper_implementations

Deep learning paper implementations with side-by-side notes and explanations

65.9K
Active
Python
Fine-tuning
Computer Vision
PyTorch
#deep-learning#pytorch#transformers

RVC-Boss/GPT-SoVITS

Few-shot voice cloning and TTS with 1 min training data

55.5K
Active
Python
AI Voice & Speech
#tts#voice-clone#few-shot

huggingface/pytorch-image-models

A collection of PyTorch image encoders/backbones with training, evaluation, and inference scripts.

36.4K
Active
Python
LLM Frameworks
Full-Stack Frameworks
Next.js
#PyTorch#Image Models#Deep Learning

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Voice conversion framework with web UI for training and real-time voice models

34.7K
Archived
Python
AI Voice & Speech
Python
#voice-conversion#audio-processing#real-time

svc-develop-team/so-vits-svc

Singing Voice Conversion framework using AI

28.0K
Archived
Python
AI Voice & Speech
PyTorch
#ai#audio-analysis#deep-learning

fishaudio/fish-speech

FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.

25.1K
Active
Python
AI Voice & Speech
#tts#voice-cloning#ai-speech

lukas-blecher/LaTeX-OCR

A deep learning model that converts images of mathematical equations into LaTeX code.

16.2K
Archived
Python
Computer Vision
PyTorch
#ocr#latex#math

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

OpenGVLab/InternVL

An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.

9.9K
Stable
Python
LLM Frameworks
Python
#gpt#llm#multimodal

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K
Active
Python
AI Voice & Speech
PyTorch
#voice-conversion#speech-synthesis#realtime

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K
Active
Python
LLM Frameworks
React
#LLM#TTS#VITS2

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#deep-learning#speech-synthesis

open-compass/VLMEvalKit

Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.

3.9K
Active
Python
LLM Frameworks
LLM Wrappers & SDKs
PyTorch
#chatgpt#llm#multi-modal

innnky/so-vits-svc

Vits-based audio colorization model for music

3.8K
Archived
Python
AI Editors/Agents/Copilot
#Audio Colorization Model#VITS#SoftVC

towhee-io/towhee

A fast and simple framework for building neural data processing pipelines using Python.

3.5K
Archived
Python
LLM Frameworks
Computer Vision
Python
#machine-learning#computer-vision#embeddings

thu-ml/SageAttention

Quantized attention that achieves 2-5x speedup over FlashAttention for language, image, and video models.

3.2K
Active
Cuda
Inference
Quantization
PyTorch
#attention#efficient-attention#inference-acceleration

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-to-speech#text-to-speech#voice-conversion

CjangCjengh/MoeGoe

Executable file for VITS inference, a neural text-to-speech model for generating high-quality speech.

2.4K
Archived
Python
LLM Frameworks
Inference
Python
#text-to-speech#neural-network#inference

roboflow/inference

Turn any computer or edge device into a command center for your computer vision projects.

2.2K
Active
Python
Computer Vision
API Frameworks
Docker
#computer-vision#inference#object-detection
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.