Showing 81-100 of 426 projects
pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.
An open-source toolkit for computer vision tasks, supporting a wide range of deep learning models and applications.
docTR is a high-performing and accessible library for OCR-related tasks powered by deep learning.
A comprehensive visual classification and recognition library powered by the PaddlePaddle deep learning framework.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
An open-source implementation of the Shazam audio fingerprinting algorithm for song recognition in Go.
Fawkes is a privacy-preserving tool against facial recognition systems, built using Python.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities
A PyTorch-based implementation of the Single Shot MultiBox Detector for object detection in computer vision tasks.
Pretrained PyTorch models for face detection and facial recognition, useful for building computer vision applications.
A Python tool that uses image recognition and reverse image search to find people's social media profiles, designed for 'vibe coders'.
OpenMMLab's toolbox and benchmark for advanced video understanding and action recognition.
A Tensorflow solution for Named Entity Recognition (NER) using the BERT-BiLSTM-CRF model with BERT fine-tuning.
A curated collection of papers and resources related to various aspects of face recognition technology.
Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.
An open-source toolbox for text detection, recognition, and understanding tasks powered by PyTorch.
A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.
Get weekly updates on trending AI coding tools and projects.