Explore Projects

Discover 84 open source projects

Active filters (1):
Search: speech-recognitionร—
Clear all

Showing 61-80 of 84 projects

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K
Archived
Jupyter Notebook
Speech Recognition
Machine Learning
Jupyter Notebook
#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K
Active
C++
AI Voice & Speech
Realtime
#live-streaming#realtime-transcription#speech-recognition

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K
Stable
LLM Frameworks
Speech Recognition
#audio-processing#speech-recognition#video-understanding

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K
Active
C++
AI Voice & Speech
API Frameworks
#speech-recognition#speech-synthesis#machine-translation

sdkcarlos/artyom.js

A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.

1.3K
Archived
JavaScript
AI Voice & Speech
Frontend Frameworks
JavaScript
#speech-recognition#speech-synthesis#voice-commands

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-translation#speech-synthesis

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K
Experimental
Python
AI Voice & Speech
BaaS Platforms
Python
#speech-recognition#asr#kaldi

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K
Archived
Python
Audio & Speech
Signal Processing
PyTorch
#audio-processing#speech-recognition#signal-processing

Softcatala/whisper-ctranslate2

A Python command-line client for the Whisper speech-to-text model by OpenAI, using the CTranslate2 library.

1.2K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#openai-whisper#speech-recognition#speech-to-text

yeyupiaoling/Whisper-Finetune

Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.

1.2K
Stable
C
Fine-tuning
Inference
PyTorch
#speech-recognition#whisper#fine-tune

modal-labs/quillman

A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.

1.2K
Experimental
Python
AI Voice & Speech
Serverless
#speech-recognition#text-to-speech#serverless

alan-ai/alan-sdk-cordova

The Alan AI SDK for Cordova provides a conversational AI interface for building voice-enabled apps.

1.1K
Experimental
Ruby
AI Voice & Speech
Component Libraries (React)
React
#chatbot#conversational-ai#speech-recognition

ashishpatel26/Treasure-of-Transformers

A comprehensive collection of Transformer models for natural language processing tasks

1.1K
Experimental
Jupyter Notebook
LLM Frameworks
Tutorials & Courses
PyTorch
#natural-language-processing#transformer-models#pretrained-models

lhotse-speech/lhotse

Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.

1.1K
Active
Python
Speech & Voice
Data Pipelines
PyTorch
#speech-recognition#audio-processing#data-handling

janvarev/Irene-Voice-Assistant

Offline Russian voice assistant with plugin-based skills for developers working with AI tools.

1.1K
Active
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-synthesis#tts

sooftware/conformer

Unofficial PyTorch implementation of the Conformer model for speech recognition tasks.

1.1K
Active
Python
AI Voice & Speech
Backend Frameworks
PyTorch
#speech-recognition#convolution#transformer

alumae/kaldi-gstreamer-server

A real-time speech recognition server built with the Kaldi toolkit and GStreamer framework.

1.1K
Archived
Python
AI Voice & Speech
#speech-recognition#real-time#open-source

matthiasn/lotti

AI-powered digital assistant that keeps your data private, with intelligent summaries and task tracking.

1.1K
Active
Dart
AI Voice & Speech
Component Libraries (Flutter)
Flutter
#ai-assistant#private-data#speech-recognition

ardha27/AI-Waifu-Vtuber

An open-source AI-powered virtual YouTuber (VTuber) platform built with Python for streaming on YouTube and Twitch.

1.0K
Archived
Python
AI Voice & Speech
Animation & Motion
Python
#ai-vtuber#virtual-youtuber#streaming

Stay in the loop

Get weekly updates on trending AI coding tools and projects.