Explore Projects

Discover 84 open source projects

Active filters (1):

Search: speech-recognition×

Clear all

Showing 61-80 of 84 projects

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K

Archived

Jupyter Notebook

Speech Recognition

Machine Learning

Jupyter Notebook

#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K

Active

C++

AI Voice & Speech

Realtime

#live-streaming#realtime-transcription#speech-recognition

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K

Stable

LLM Frameworks

Speech Recognition

#audio-processing#speech-recognition#video-understanding

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K

Active

C++

AI Voice & Speech

API Frameworks

#speech-recognition#speech-synthesis#machine-translation

sdkcarlos/artyom.js

A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.

1.3K

Archived

JavaScript

AI Voice & Speech

Frontend Frameworks

JavaScript

#speech-recognition#speech-synthesis#voice-commands

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#speech-translation#speech-synthesis

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K

Experimental

Python

AI Voice & Speech

BaaS Platforms

Python

#speech-recognition#asr#kaldi

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K

Archived

Python

Audio & Speech

Signal Processing

PyTorch

#audio-processing#speech-recognition#signal-processing

Softcatala/whisper-ctranslate2

A Python command-line client for the Whisper speech-to-text model by OpenAI, using the CTranslate2 library.

1.2K

Stable

Python

LLM Wrappers & SDKs

API Frameworks

Python

#openai-whisper#speech-recognition#speech-to-text

yeyupiaoling/Whisper-Finetune

Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.

1.2K

Stable

Fine-tuning

Inference

PyTorch

#speech-recognition#whisper#fine-tune

modal-labs/quillman

A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.

1.2K

Experimental

Python

AI Voice & Speech

Serverless

#speech-recognition#text-to-speech#serverless

alan-ai/alan-sdk-cordova

The Alan AI SDK for Cordova provides a conversational AI interface for building voice-enabled apps.

1.1K

Experimental

Ruby

AI Voice & Speech

Component Libraries (React)

React

#chatbot#conversational-ai#speech-recognition

ashishpatel26/Treasure-of-Transformers

A comprehensive collection of Transformer models for natural language processing tasks

1.1K

Experimental

Jupyter Notebook

LLM Frameworks

Tutorials & Courses

PyTorch

#natural-language-processing#transformer-models#pretrained-models

lhotse-speech/lhotse

Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.

1.1K

Active

Python

Speech & Voice

Data Pipelines

PyTorch

#speech-recognition#audio-processing#data-handling

janvarev/Irene-Voice-Assistant

Offline Russian voice assistant with plugin-based skills for developers working with AI tools.

1.1K

Active

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#speech-synthesis#tts

sooftware/conformer

Unofficial PyTorch implementation of the Conformer model for speech recognition tasks.

1.1K

Active

Python

AI Voice & Speech

Backend Frameworks

PyTorch

#speech-recognition#convolution#transformer

alumae/kaldi-gstreamer-server

A real-time speech recognition server built with the Kaldi toolkit and GStreamer framework.

1.1K

Archived

Python

AI Voice & Speech

#speech-recognition#real-time#open-source

matthiasn/lotti

AI-powered digital assistant that keeps your data private, with intelligent summaries and task tracking.

1.1K

Active

Dart

AI Voice & Speech

Component Libraries (Flutter)

Flutter

#ai-assistant#private-data#speech-recognition

ardha27/AI-Waifu-Vtuber

An open-source AI-powered virtual YouTuber (VTuber) platform built with Python for streaming on YouTube and Twitch.

1.0K

Archived

Python

AI Voice & Speech

Animation & Motion

Python

#ai-vtuber#virtual-youtuber#streaming

1 2 35

Stay in the loop

Get weekly updates on trending AI coding tools and projects.