Explore Projects

Discover 426 open source projects

Active filters (1):

Search: recognition×

Clear all

Showing 281-300 of 426 projects

cmusphinx/sphinx4

A pure Java speech recognition library that can be used in various applications.

1.4K

Archived

Java

AI Voice & Speech

#speech-recognition#natural-language-processing#audio-processing

alexsosn/iOS_ML

A curated list of Machine Learning, AI, and NLP solutions for iOS development.

1.4K

Archived

ML SDKs & Wrappers

iOS

Swift

#machine-learning#artificial-intelligence#computer-vision

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

1.4K

Archived

Python

LLM Frameworks

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-text-pretraining

Ewenwan/Ros

An open-source robotics operating system (ROS) with support for speech recognition, semantic understanding, visual control, and Gazebo simulation.

1.4K

Archived

Makefile

Computer Vision

Realtime

#robotics#ros#computer-vision

m1guelpf/yt-whisper

Automatically generate YouTube subtitles using OpenAI's Whisper speech recognition model

1.4K

Archived

Python

LLM Wrappers & SDKs

Subtitles & Transcription

Python

#openai#whisper#subtitles

sc0ty/subsync

A C++ library for synchronizing subtitles with audio/video content using speech recognition.

1.4K

Archived

C++

API Frameworks

AI Voice & Speech

#speech-recognition#subtitle-synchronization#subtitle-processing

mdn/web-speech-api

Provides demos and examples for the Web Speech API, a powerful tool for adding speech recognition and synthesis to web apps.

1.4K

Archived

JavaScript

Frontend Frameworks

AI Voice & Speech

JavaScript

#speech-recognition#speech-synthesis#voice-interface

jakowenko/double-take

A unified UI and API for processing and training images for facial recognition across various AI tools.

1.4K

Stable

JavaScript

Computer Vision

API Development

Node.js

#facial-recognition#home-automation#mqtt

birdnet-team/BirdNET-Analyzer

A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.

1.4K

Stable

Python

Computer Vision

Caching

#bioacoustics#bird-song#deep-learning

hungtraan/FacebookBot

A Facebook Messenger Bot with voice recognition, NLP, and features like restaurant search and memo transcription.

1.4K

Archived

JavaScript

AI Voice & Speech

API Frameworks

Node

#voice-recognition#natural-language-processing#restaurant-search

DWCTOD/CVPR2024-Papers-with-Code-Demo

A curation of the latest CVPR (Computer Vision and Pattern Recognition) papers, code, and demos for AI-powered developers.

1.4K

Archived

Computer Vision

Tutorials & Courses

#computer-vision#cvpr#tutorials

seathiefwang/FaceRecognition-tensorflow

A TensorFlow-based face recognition neural network library for Python developers.

1.4K

Archived

Python

Computer Vision

TensorFlow

#computer-vision#face-recognition#neural-network

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K

Archived

Python

AI Voice & Speech

API Frameworks

#speech-recognition#natural-language-processing#ubuntu

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K

Archived

Jupyter Notebook

Speech Recognition

Machine Learning

Jupyter Notebook

#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K

Active

C++

AI Voice & Speech

Realtime

#live-streaming#realtime-transcription#speech-recognition

lightaime/cs231n

This repository contains solutions to assignments for the CS231n course on Convolutional Neural Networks for Visual Recognition.

1.4K

Archived

Jupyter Notebook

Computer Vision

Jupyter Notebook

#computer-vision#deep-learning#neural-networks

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K

Stable

LLM Frameworks

Speech Recognition

#audio-processing#speech-recognition#video-understanding

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

zhubenfu/License-Plate-Detect-Recognition-via-Deep-Neural-Networks-accuracy-up-to-99.9

A highly accurate deep learning-based license plate detection and recognition system for real-time use.

1.4K

Archived

C++

Computer Vision

#computer-vision#deep-learning#license-plate-recognition

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K

Active

C++

AI Voice & Speech

API Frameworks

#speech-recognition#speech-synthesis#machine-translation

1...1416...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.