Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Clear all

Showing 1-20 of 368 projects

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K

Active

Python

LLM Frameworks

Agents & Orchestration

PyTorch

#transformers#huggingface#deep-learning

openai/whisper

Robust speech recognition model for multilingual tasks

95.5K

Stable

Python

AI Voice & Speech

PyTorch

#speech-recognition#multilingual#audio-processing

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K

Archived

Python

LLM Frameworks

RAG & Vector

Python

#nlp#chinese-nlp#ai-resources

CorentinJ/Real-Time-Voice-Cloning

Real-time voice cloning using deep learning

59.5K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#voice-cloning#tts#deep-learning

RVC-Boss/GPT-SoVITS

Few-shot voice cloning and TTS with 1 min training data

55.5K

Active

Python

AI Voice & Speech

#tts#voice-clone#few-shot

unslothai/unsloth

Fine-tuning & RL for LLMs with optimized performance and memory use

53.4K

Active

Python

Fine-tuning

#llm#fine-tuning#reinforcement-learning

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K

Active

C++

Inference

CLI Tools

#speech-to-text#c++#inference

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K

Archived

Python

AI Voice & Speech

PyTorch

#text-to-speech#deep-learning#speech-synthesis

suno-ai/bark

Text-to-audio model for generating realistic speech and sounds

39.0K

Archived

Jupyter Notebook

AI Voice & Speech

#text-to-speech#audio-generation#ai-model

2noise/ChatTTS

Generates natural speech for dialogue scenarios

38.9K

Active

Python

AI Voice & Speech

Python

#tts#speech-synthesis#ai-voice

babysor/MockingBird

Voice cloning tool for real-time speech generation

36.9K

Stable

Python

AI Voice & Speech

PyTorch

#voice-cloning#tts#speech-synthesis

myshell-ai/OpenVoice

Instant voice cloning model with tone color cloning and multi-lingual support

36.0K

Experimental

Python

AI Voice & Speech

SaaS Boilerplates

Python

#voice-clone#text-to-speech#zero-shot-tts

svc-develop-team/so-vits-svc

Singing Voice Conversion framework using AI

28.0K

Archived

Python

AI Voice & Speech

PyTorch

#ai#audio-analysis#deep-learning

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K

Experimental

C++

AI Voice & Speech

Documentation

TensorFlow

#speech-to-text#deep-learning#tensorflow

fishaudio/fish-speech

FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.

25.1K

Active

Python

AI Voice & Speech

#tts#voice-cloning#ai-speech

OpenBMB/MiniCPM-o

On-device multimodal LLM for vision, speech, and live streaming on phones

24.0K

Active

Python

Inference

Local Inference Engines

llama.cpp-omni

#minicpm-o#multimodal-llm#on-device-ai

microsoft/VibeVoice

Open-source voice AI models for speech synthesis and recognition

23.6K

Active

Python

AI Voice & Speech

#voice-ai#speech-synthesis#speech-recognition

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

huggingface/datasets

AI-powered dataset management and preprocessing library for ML projects

21.2K

Active

Python

ML Ops

ETL & Pipelines

HuggingFace

#datasets#ml-ops#data-preprocessing

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

2...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.