Explore Projects

Discover 36 open source projects

Active filters (1):

Search: vits×

Clear all

Showing 1-20 of 36 projects

labmlai/annotated_deep_learning_paper_implementations

Deep learning paper implementations with side-by-side notes and explanations

65.9K

Active

Python

Fine-tuning

Computer Vision

PyTorch

#deep-learning#pytorch#transformers

RVC-Boss/GPT-SoVITS

Few-shot voice cloning and TTS with 1 min training data

55.5K

Active

Python

AI Voice & Speech

#tts#voice-clone#few-shot

huggingface/pytorch-image-models

A collection of PyTorch image encoders/backbones with training, evaluation, and inference scripts.

36.4K

Active

Python

LLM Frameworks

Full-Stack Frameworks

Next.js

#PyTorch#Image Models#Deep Learning

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Voice conversion framework with web UI for training and real-time voice models

34.7K

Archived

Python

AI Voice & Speech

Python

#voice-conversion#audio-processing#real-time

svc-develop-team/so-vits-svc

Singing Voice Conversion framework using AI

28.0K

Archived

Python

AI Voice & Speech

PyTorch

#ai#audio-analysis#deep-learning

fishaudio/fish-speech

FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.

25.1K

Active

Python

AI Voice & Speech

#tts#voice-cloning#ai-speech

lukas-blecher/LaTeX-OCR

A deep learning model that converts images of mathematical equations into LaTeX code.

16.2K

Archived

Python

Computer Vision

PyTorch

#ocr#latex#math

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

OpenGVLab/InternVL

An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.

9.9K

Stable

Python

LLM Frameworks

Python

#gpt#llm#multimodal

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K

Experimental

Python

AI Audio & Speech

Python

#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K

Active

Python

AI Voice & Speech

PyTorch

#voice-conversion#speech-synthesis#realtime

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K

Active

Python

LLM Frameworks

React

#LLM#TTS#VITS2

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#deep-learning#speech-synthesis

open-compass/VLMEvalKit

Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.

3.9K

Active

Python

LLM Frameworks

LLM Wrappers & SDKs

PyTorch

#chatgpt#llm#multi-modal

innnky/so-vits-svc

Vits-based audio colorization model for music

3.8K

Archived

Python

AI Editors/Agents/Copilot

#Audio Colorization Model#VITS#SoftVC

towhee-io/towhee

A fast and simple framework for building neural data processing pipelines using Python.

3.5K

Archived

Python

LLM Frameworks

Computer Vision

Python

#machine-learning#computer-vision#embeddings

thu-ml/SageAttention

Quantized attention that achieves 2-5x speedup over FlashAttention for language, image, and video models.

3.2K

Active

Cuda

Inference

Quantization

PyTorch

#attention#efficient-attention#inference-acceleration

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-to-speech#text-to-speech#voice-conversion

CjangCjengh/MoeGoe

Executable file for VITS inference, a neural text-to-speech model for generating high-quality speech.

2.4K

Archived

Python

LLM Frameworks

Inference

Python

#text-to-speech#neural-network#inference

roboflow/inference

Turn any computer or edge device into a command center for your computer vision projects.

2.2K

Active

Python

Computer Vision

API Frameworks

Docker

#computer-vision#inference#object-detection

Stay in the loop

Get weekly updates on trending AI coding tools and projects.