Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognitionร—
Clear all

Showing 121-140 of 426 projects

guillaume-chevalier/LSTM-Human-Activity-Recognition

A TensorFlow-based example of human activity recognition using an LSTM RNN on smartphone sensor data.

3.5K
Archived
Jupyter Notebook
Computer Vision
Tutorials & Courses
TensorFlow
#activity-recognition#deep-learning#lstm

aim-uofa/AdelaiDet

AdelaiDet is an open-source toolbox for instance-level detection and recognition tasks, including object detection and text recognition.

3.5K
Archived
Python
Computer Vision
API Frameworks
Python
#object-detection#instance-segmentation#text-detection

MaaXYZ/MaaFramework

An automation black-box testing framework based on image recognition for developers.

3.4K
Active
C++
API Frameworks
Computer Vision
#black-box-testing#computer-vision#automation

xenova/whisper-web

A TypeScript-powered web app that brings ML-powered speech recognition to the browser using the Whisper AI model.

3.3K
Archived
TypeScript
AI Voice & Speech
Frontend Frameworks
React
#speech-recognition#ai-models#browser-based

hankcs/pyhanlp

An open-source Chinese NLP library providing state-of-the-art tools for word segmentation, dependency parsing, named entity recognition, and more.

3.2K
Archived
Python
NLP
Python
#chinese-nlp#word-segmentation#dependency-parsing

kerlomz/captcha_trainer

This open-source Python library is a training platform for building deep learning models to recognize captchas.

3.2K
Stable
Python
Computer Vision
API Frameworks
TensorFlow
#captcha-recognition#ocr#deep-learning

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K
Stable
Python
AI Voice & Speech
API Clients & Testing
Flask
#speech-recognition#automatic-speech-recognition#openai-whisper

stepfun-ai/Step-Video-T2V

A Python library for converting video files to text transcripts using AI-powered speech recognition.

3.2K
Experimental
Python
AI Video & Speech
None
#speech-recognition#text-transcription#video-to-text

deepdoctection/deepdoctection

A Python library for document AI tasks like layout analysis, table detection, and text extraction.

3.1K
Active
Python
Computer Vision
API Frameworks
PyTorch
#document-ai#document-analysis#ocr

luigifreda/pyslam

Python/C++ Visual SLAM pipeline for 3D reconstruction

3.1K
Active
Python
Machine Learning & AI Libraries
#pySLAM#Visual SLAM#3D Reconstruction

aoguai/LiYing

An automated photo processing program designed to automate the post-processing workflow of ID photos in photo studios.

3.1K
Stable
Python
File Storage
Image Processing
Python
#background-replacement#image-compression#image-cropping

sgrvinod/a-PyTorch-Tutorial-to-Object-Detection

A PyTorch tutorial for object detection using the Single Shot MultiBox Detector (SSD) algorithm.

3.1K
Archived
Python
Computer Vision
Tutorials & Courses
PyTorch
#object-detection#object-recognition#pytorch-tutorial

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K
Archived
AI Voice & Speech
#speech-recognition#speech-synthesis#language-modeling

open-mmlab/mmskeleton

A Python library for human pose estimation, skeleton-based action recognition, and action synthesis.

3.1K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#computer-vision#pose-estimation#action-recognition

tusen-ai/simpledet

A Simple and Versatile Framework for Object Detection and Instance Recognition

3.1K
Archived
Python
React
#object-detection#instance-recognition#mxnet

vladmandic/human

A comprehensive AI-powered computer vision library for face detection, tracking, recognition, and more.

3.0K
Stable
HTML
Computer Vision
API Frameworks
TensorFlow.js
#face-detection#face-recognition#body-tracking

breezedeus/Pix2Text

An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.

3.0K
Experimental
Jupyter Notebook
Computer Vision
File Storage
PyTorch
#ocr#latex#math-formula-recognition

theajack/cnchar

A comprehensive TypeScript library for working with Chinese characters, including features like pinyin, stroke, and voice recognition.

3.0K
Experimental
TypeScript
Frontend Frameworks
CLI Tools
TypeScript
#chinese-characters#pinyin#stroke-recognition

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K
Experimental
C
AI Voice & Speech
AI App Builders
ESP-IDF
#alexa#google-home#speech-recognition

Cartucho/mAP

A Python library for evaluating the performance of neural networks for object detection.

3.0K
Archived
Python
Computer Vision
Testing
#object-detection#neural-network#evaluation
1...68...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.