Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 221-240 of 368 projects

ckiplab/ckiptagger

A Python library for Chinese word segmentation, part-of-speech tagging, and named entity recognition.

1.7K
Experimental
Python
Natural Language Processing
Python
#natural-language-processing#word-segmentation#pos-tagging

alan-ai/alan-sdk-ionic

A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.

1.7K
Experimental
TypeScript
React
#ionic#chatbot#conversational-ai

szczyglis-dev/py-gpt

A Python-based desktop AI assistant that integrates with various LLMs and AI tools for coding, task automation, and more.

1.7K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-assistant#llm#automation

neural-maze/ava-whatsapp-agent-course

A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.

1.6K
Stable
Python
Agents & Orchestration
AI Voice & Speech
#agent#stt#tts

absadiki/subsai

A Python-based tool for generating subtitles using OpenAI's Whisper speech recognition model.

1.6K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#subtitles#speech-recognition#whisper

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K
Stable
C++
AI Voice & Speech
Cross-Platform
#speech-recognition#voice-activity-detection#offline

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN repository with PyTorch implementation of various neural vocoders for speech synthesis.

1.6K
Archived
Jupyter Notebook
AI Voice & Speech
Backend Frameworks
PyTorch
#speech-synthesis#neural-vocoder#parallel-wavenet

GetStream/stream-chat-android

An open-source Android chat SDK that provides a full-featured chat experience for mobile apps, with support for Kotlin and Jetpack Compose.

1.6K
Active
Kotlin
Component Libraries (Android)
Chat & Messaging
Android
#android-chat#chat-sdk#kotlin-chat

iamsrikanthnani/pluely

An open-source AI assistant that works seamlessly in meetings, interviews, and conversations without detection.

1.6K
Active
TypeScript
AI Assistants
Desktop Apps
React
#ai-assistant#undetectable#privacy-first

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

OpenWhispr/openwhispr

Cross-platform voice-to-text app with local & cloud AI models, privacy-first architecture

1.6K
Active
TypeScript
Desktop Model Runners
AI Voice & Speech
Whisper
#voice-to-text#local-inference#whisper-api

Marak/say.js

A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.

1.5K
Archived
JavaScript
API Frameworks
#tts#text-to-speech#audio

BytedanceSpeech/seed-tts-eval

This Python repository provides an evaluation framework for text-to-speech models, focusing on enabling vibe coder development with AI tools.

1.5K
Archived
Python
AI Voice & Speech
Testing
Python
#text-to-speech#speech-synthesis#model-evaluation

Olcmyk/HuChenFeng

A collection of content from a Chinese internet personality, including livestream transcripts and text analysis.

1.5K
Stable
Speech-to-Text
Text Analysis
#archiving#content-analysis#livestream

steve228uk/MessengerKit

A UI framework for building messenger-style interfaces on iOS devices.

1.5K
Archived
Swift
Component Libraries (React)
iOS
Swift
#chat#messenger#ios

google/live-transcribe-speech-engine

Live Transcribe is an Android app that provides real-time captioning for people who are deaf or hard of hearing.

1.5K
Archived
Java
AI Voice & Speech
#accessibility#captioning#transcription

Hironsan/anago

A Python library for named-entity recognition, part-of-speech tagging, and other NLP tasks using LSTM-CRF and ELMo models.

1.5K
Archived
Python
NLP
API Frameworks
Keras
#named-entity-recognition#part-of-speech-tagging#sequence-labeling

antirez/voxtral.c

Pure C inference engine for Mistral Voxtral 4B speech-to-text model with minimal dependencies

1.5K
Active
C
Local Inference Engines
Inference
C
#speech-to-text#voxtral#c-inference

AlekPet/ComfyUI_Custom_Nodes_AlekPet

A collection of custom nodes that extend the capabilities of the ComfyUI AI coding tool.

1.5K
Active
JavaScript
AI Code Editors
LLM Frameworks
React
#comfyui#stable-diffusion#pose-detection

microsoft/NeuralSpeech

A library for speech synthesis and recognition using neural networks

1.5K
Archived
Python
Prompt Engineering
None
React
#speech-synthesis#neural-networks#prompt-engineering
1...1113...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.