Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognitionร—
Clear all

Showing 281-300 of 426 projects

cmusphinx/sphinx4

A pure Java speech recognition library that can be used in various applications.

1.4K
Archived
Java
AI Voice & Speech
#speech-recognition#natural-language-processing#audio-processing

alexsosn/iOS_ML

A curated list of Machine Learning, AI, and NLP solutions for iOS development.

1.4K
Archived
ML SDKs & Wrappers
iOS
Swift
#machine-learning#artificial-intelligence#computer-vision

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

1.4K
Archived
Python
LLM Frameworks
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-text-pretraining

Ewenwan/Ros

An open-source robotics operating system (ROS) with support for speech recognition, semantic understanding, visual control, and Gazebo simulation.

1.4K
Archived
Makefile
Computer Vision
Realtime
#robotics#ros#computer-vision

m1guelpf/yt-whisper

Automatically generate YouTube subtitles using OpenAI's Whisper speech recognition model

1.4K
Archived
Python
LLM Wrappers & SDKs
Subtitles & Transcription
Python
#openai#whisper#subtitles

sc0ty/subsync

A C++ library for synchronizing subtitles with audio/video content using speech recognition.

1.4K
Archived
C++
API Frameworks
AI Voice & Speech
#speech-recognition#subtitle-synchronization#subtitle-processing

mdn/web-speech-api

Provides demos and examples for the Web Speech API, a powerful tool for adding speech recognition and synthesis to web apps.

1.4K
Archived
JavaScript
Frontend Frameworks
AI Voice & Speech
JavaScript
#speech-recognition#speech-synthesis#voice-interface

jakowenko/double-take

A unified UI and API for processing and training images for facial recognition across various AI tools.

1.4K
Stable
JavaScript
Computer Vision
API Development
Node.js
#facial-recognition#home-automation#mqtt

birdnet-team/BirdNET-Analyzer

A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.

1.4K
Stable
Python
Computer Vision
Caching
#bioacoustics#bird-song#deep-learning

hungtraan/FacebookBot

A Facebook Messenger Bot with voice recognition, NLP, and features like restaurant search and memo transcription.

1.4K
Archived
JavaScript
AI Voice & Speech
API Frameworks
Node
#voice-recognition#natural-language-processing#restaurant-search

DWCTOD/CVPR2024-Papers-with-Code-Demo

A curation of the latest CVPR (Computer Vision and Pattern Recognition) papers, code, and demos for AI-powered developers.

1.4K
Archived
Computer Vision
Tutorials & Courses
#computer-vision#cvpr#tutorials

seathiefwang/FaceRecognition-tensorflow

A TensorFlow-based face recognition neural network library for Python developers.

1.4K
Archived
Python
Computer Vision
TensorFlow
#computer-vision#face-recognition#neural-network

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K
Archived
Python
AI Voice & Speech
API Frameworks
#speech-recognition#natural-language-processing#ubuntu

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K
Archived
Jupyter Notebook
Speech Recognition
Machine Learning
Jupyter Notebook
#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K
Active
C++
AI Voice & Speech
Realtime
#live-streaming#realtime-transcription#speech-recognition

lightaime/cs231n

This repository contains solutions to assignments for the CS231n course on Convolutional Neural Networks for Visual Recognition.

1.4K
Archived
Jupyter Notebook
Computer Vision
Jupyter Notebook
#computer-vision#deep-learning#neural-networks

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K
Stable
LLM Frameworks
Speech Recognition
#audio-processing#speech-recognition#video-understanding

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

zhubenfu/License-Plate-Detect-Recognition-via-Deep-Neural-Networks-accuracy-up-to-99.9

A highly accurate deep learning-based license plate detection and recognition system for real-time use.

1.4K
Archived
C++
Computer Vision
#computer-vision#deep-learning#license-plate-recognition

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K
Active
C++
AI Voice & Speech
API Frameworks
#speech-recognition#speech-synthesis#machine-translation
1...1416...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.