Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognition×
Clear all

Showing 261-280 of 426 projects

google/uis-rnn

A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.

1.6K
Archived
Python
LLM Frameworks
API Frameworks
Python
#clustering#machine-learning#speaker-diarization

yjxiong/temporal-segment-networks

Code and models for Temporal Segment Networks (TSN) for action recognition in video understanding.

1.6K
Archived
Python
Computer Vision
#action-recognition#temporal-segment-networks#video-understanding

Intellindust-AI-Lab/DEIMv2

A real-time object detection model that leverages the DINO transformer for efficient and accurate object recognition.

1.6K
Active
Jupyter Notebook
Computer Vision
API Frameworks
Jupyter Notebook
#object-detection#real-time#dino-transformer

Tencent-Hunyuan/HunyuanOCR

A Python-based OCR (Optical Character Recognition) library for Tencent Hunyuan, a vibe coder's AI-powered developer platform.

1.6K
Active
Python
Computer Vision
MCP Frameworks
Python
#ocr#computer-vision#ai-tools

MgArcher/Text_select_captcha

A Python library for training and deploying image-based captcha recognition models.

1.6K
Archived
Python
Computer Vision
PyTorch
#captcha#image-recognition#machine-learning

CLUEbenchmark/CLUENER2020

CLUENER2020 is a Chinese fine-grained named entity recognition dataset and benchmark for AI-powered NLP development.

1.5K
Archived
Python
Fine-tuning
Databases
Python
#chinese-ner#named-entity-recognition#seq2seq

JinpengLI/deep_ocr

A Python library for building a better Chinese character recognition OCR than Tesseract.

1.5K
Archived
Python
Computer Vision
API Frameworks
Python
#ocr#chinese-character-recognition#computer-vision

tesseract-ocr/tessdata_best

A library of pre-trained LSTM models for optical character recognition (OCR) tasks.

1.5K
Archived
Computer Vision
#ocr#computer-vision#pre-trained-models

declare-lab/conv-emotion

This repository provides implementations of different architectures for emotion recognition in conversations.

1.5K
Archived
Python
Agents & Orchestration
Emotion Analysis
PyTorch
#conversational-ai#dialogue-systems#emotion-recognition

easytarget/esp32-cam-webserver

An expanded version of the Espressif ESP32-CAM webcam with support for AI-powered features like face recognition.

1.5K
Archived
C
Computer Vision
Arduino & Embedded
#esp32-cam#face-recognition#webcam

Feather-2/Burner-X

Burner X is a browser-based tool for AI literature recognition, document batch translation, and smart analysis.

1.5K
Active
JavaScript
React
#authentication#streaming#real-time

Hironsan/anago

A Python library for named-entity recognition, part-of-speech tagging, and other NLP tasks using LSTM-CRF and ELMo models.

1.5K
Archived
Python
NLP
API Frameworks
Keras
#named-entity-recognition#part-of-speech-tagging#sequence-labeling

faustomorales/keras-ocr

A flexible Python library for optical character recognition (OCR) using the CRAFT text detector and Keras CRNN recognition model.

1.5K
Stable
Python
Computer Vision
API Frameworks
Keras
#ocr#text-detection#keras-crnn

AlekPet/ComfyUI_Custom_Nodes_AlekPet

A collection of custom nodes that extend the capabilities of the ComfyUI AI coding tool.

1.5K
Active
JavaScript
AI Code Editors
LLM Frameworks
React
#comfyui#stable-diffusion#pose-detection

buppt/ChineseNER

A Chinese Named Entity Recognition (NER) library built with PyTorch and TensorFlow

1.5K
Archived
Python
NER & Text Processing
API Frameworks
PyTorch
#named-entity-recognition#chinese-nlp#bilstm-crf

Sanster/text_renderer

Generate text images for training deep learning OCR models, a key tool for vibe coders working with AI-powered text recognition.

1.5K
Archived
Python
Computer Vision
CLI Tools
Python
#ocr#computer-vision#text-generation

microsoft/NeuralSpeech

A library for speech synthesis and recognition using neural networks

1.5K
Archived
Python
Prompt Engineering
None
React
#speech-synthesis#neural-networks#prompt-engineering

megvii-research/ML-GCN

PyTorch implementation of a multi-label image recognition model using graph convolutional networks.

1.4K
Archived
Python
Computer Vision
PyTorch
#computer-vision#image-recognition#graph-convolutional-networks

semperai/amica

Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.

1.4K
Experimental
TypeScript
AI Voice & Speech
Computer Vision
TypeScript
#ai-assistant#speech-recognition#text-to-speech

lanxiuyun/lazyeat

Lazyeat is a touch-free controller for use while eating, allowing developers to pause videos/full screen/switch videos just by gesturing to the camera.

1.4K
Active
Vue
Vue
#gesture-detection#hands-free#webcam-hacks
1...1315...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.