Showing 261-280 of 426 projects
A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.
Code and models for Temporal Segment Networks (TSN) for action recognition in video understanding.
A real-time object detection model that leverages the DINO transformer for efficient and accurate object recognition.
A Python-based OCR (Optical Character Recognition) library for Tencent Hunyuan, a vibe coder's AI-powered developer platform.
A Python library for training and deploying image-based captcha recognition models.
CLUENER2020 is a Chinese fine-grained named entity recognition dataset and benchmark for AI-powered NLP development.
A Python library for building a better Chinese character recognition OCR than Tesseract.
A library of pre-trained LSTM models for optical character recognition (OCR) tasks.
This repository provides implementations of different architectures for emotion recognition in conversations.
An expanded version of the Espressif ESP32-CAM webcam with support for AI-powered features like face recognition.
Burner X is a browser-based tool for AI literature recognition, document batch translation, and smart analysis.
A Python library for named-entity recognition, part-of-speech tagging, and other NLP tasks using LSTM-CRF and ELMo models.
A flexible Python library for optical character recognition (OCR) using the CRAFT text detector and Keras CRNN recognition model.
A collection of custom nodes that extend the capabilities of the ComfyUI AI coding tool.
A Chinese Named Entity Recognition (NER) library built with PyTorch and TensorFlow
Generate text images for training deep learning OCR models, a key tool for vibe coders working with AI-powered text recognition.
A library for speech synthesis and recognition using neural networks
PyTorch implementation of a multi-label image recognition model using graph convolutional networks.
Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.
Lazyeat is a touch-free controller for use while eating, allowing developers to pause videos/full screen/switch videos just by gesturing to the camera.
Get weekly updates on trending AI coding tools and projects.