Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speech×
Clear all

Showing 1-20 of 368 projects

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K
Active
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#transformers#huggingface#deep-learning

openai/whisper

Robust speech recognition model for multilingual tasks

95.5K
Stable
Python
AI Voice & Speech
PyTorch
#speech-recognition#multilingual#audio-processing

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K
Archived
Python
LLM Frameworks
RAG & Vector
Python
#nlp#chinese-nlp#ai-resources

CorentinJ/Real-Time-Voice-Cloning

Real-time voice cloning using deep learning

59.5K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#voice-cloning#tts#deep-learning

RVC-Boss/GPT-SoVITS

Few-shot voice cloning and TTS with 1 min training data

55.5K
Active
Python
AI Voice & Speech
#tts#voice-clone#few-shot

unslothai/unsloth

Fine-tuning & RL for LLMs with optimized performance and memory use

53.4K
Active
Python
Fine-tuning
#llm#fine-tuning#reinforcement-learning

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K
Active
C++
Inference
CLI Tools
#speech-to-text#c++#inference

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K
Archived
Python
AI Voice & Speech
PyTorch
#text-to-speech#deep-learning#speech-synthesis

suno-ai/bark

Text-to-audio model for generating realistic speech and sounds

39.0K
Archived
Jupyter Notebook
AI Voice & Speech
#text-to-speech#audio-generation#ai-model

2noise/ChatTTS

Generates natural speech for dialogue scenarios

38.9K
Active
Python
AI Voice & Speech
Python
#tts#speech-synthesis#ai-voice

babysor/MockingBird

Voice cloning tool for real-time speech generation

36.9K
Stable
Python
AI Voice & Speech
PyTorch
#voice-cloning#tts#speech-synthesis

myshell-ai/OpenVoice

Instant voice cloning model with tone color cloning and multi-lingual support

36.0K
Experimental
Python
AI Voice & Speech
SaaS Boilerplates
Python
#voice-clone#text-to-speech#zero-shot-tts

svc-develop-team/so-vits-svc

Singing Voice Conversion framework using AI

28.0K
Archived
Python
AI Voice & Speech
PyTorch
#ai#audio-analysis#deep-learning

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K
Experimental
C++
AI Voice & Speech
Documentation
TensorFlow
#speech-to-text#deep-learning#tensorflow

fishaudio/fish-speech

FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.

25.1K
Active
Python
AI Voice & Speech
#tts#voice-cloning#ai-speech

OpenBMB/MiniCPM-o

On-device multimodal LLM for vision, speech, and live streaming on phones

24.0K
Active
Python
Inference
Local Inference Engines
llama.cpp-omni
#minicpm-o#multimodal-llm#on-device-ai

microsoft/VibeVoice

Open-source voice AI models for speech synthesis and recognition

23.6K
Active
Python
AI Voice & Speech
#voice-ai#speech-synthesis#speech-recognition

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

huggingface/datasets

AI-powered dataset management and preprocessing library for ML projects

21.2K
Active
Python
ML Ops
ETL & Pipelines
HuggingFace
#datasets#ml-ops#data-preprocessing

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization
2...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.