Showing 61-80 of 84 projects
A neural network model for detecting different emotions from audio speeches using Python and deep learning.
An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.
SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.
A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.
SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.
A Python command-line client for the Whisper speech-to-text model by OpenAI, using the CTranslate2 library.
Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.
A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.
The Alan AI SDK for Cordova provides a conversational AI interface for building voice-enabled apps.
A comprehensive collection of Transformer models for natural language processing tasks
Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.
Offline Russian voice assistant with plugin-based skills for developers working with AI tools.
Unofficial PyTorch implementation of the Conformer model for speech recognition tasks.
A real-time speech recognition server built with the Kaldi toolkit and GStreamer framework.
AI-powered digital assistant that keeps your data private, with intelligent summaries and task tracking.
An open-source AI-powered virtual YouTuber (VTuber) platform built with Python for streaming on YouTube and Twitch.
Get weekly updates on trending AI coding tools and projects.