Showing 1-20 of 426 projects
Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.
Robust speech recognition model for multilingual tasks
Comprehensive Chinese NLP resource collection for developers
OCR engine for text recognition in images
Simplified facial recognition API for Python and command line
High-performance C/C++ port of OpenAI's Whisper for speech recognition
Offline OCR software with batch processing, PDF support, and multi-language recognition.
Document management system for scanning, indexing, and archiving documents
Multilingual NLP toolkit for production with PyTorch/TensorFlow
Detectron2 is a PyTorch-based library for object detection, segmentation, and visual recognition tasks.
Industrial-strength NLP library for Python with pretrained models and fast processing
OCR library with 80+ languages and scripts support
2D/3D face analysis with AI
Offline speech-to-text engine for real-time on-device use
Image annotation tool for computer vision projects
JavaScript library for multi-touch gesture recognition
Open-source voice AI models for speech synthesis and recognition
Tracks progress in NLP tasks with datasets and benchmarks
DeepFace is a lightweight Python library for face recognition and facial attribute analysis, including age, gender, emotion, and race detection.
Faster Whisper transcription with CTranslate2 for efficient speech-to-text
Get weekly updates on trending AI coding tools and projects.