Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognitionร—
Clear all

Showing 81-100 of 426 projects

madmaze/pytesseract

pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.

6.3K
Active
Python
Computer Vision
#ocr#image-processing#text-extraction

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

PaddlePaddle/PaddleX

PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.

6.1K
Active
Python
Computer Vision
Natural Language Processing
Python
#computer-vision#natural-language-processing#ocr

dmlc/gluon-cv

An open-source toolkit for computer vision tasks, supporting a wide range of deep learning models and applications.

5.9K
Archived
Python
Computer Vision
ML Ops
MXNet
#computer-vision#deep-learning#machine-learning

mindee/doctr

docTR is a high-performing and accessible library for OCR-related tasks powered by deep learning.

5.9K
Stable
Python
Computer Vision
API Frameworks
PyTorch
#ocr#text-detection#text-recognition

PaddlePaddle/PaddleClas

A comprehensive visual classification and recognition library powered by the PaddlePaddle deep learning framework.

5.8K
Stable
Python
Computer Vision
API Frameworks
Python
#image-classification#image-recognition#image-retrieval

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K
Active
Swift
AI Voice & Speech
iOS
#speech-recognition#transformers#inference

cgzirim/seek-tune

An open-source implementation of the Shazam audio fingerprinting algorithm for song recognition in Go.

5.6K
Stable
Go
API Frameworks
Audio Processing
#audio-fingerprinting#shazam#song-recognition

Shawn-Shan/fawkes

Fawkes is a privacy-preserving tool against facial recognition systems, built using Python.

5.5K
Archived
Python
Computer Vision
Privacy Tools
#adversarial-machine-learning#face-recognition#privacy-enhancing-technologies

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

modelscope/FunClip

Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities

5.4K
Experimental
Python
LLM Frameworks
AI Voice & Speech
gradio
#speech-recognition#video-subtitles#llm

amdegroot/ssd.pytorch

A PyTorch-based implementation of the Single Shot MultiBox Detector for object detection in computer vision tasks.

5.2K
Archived
Python
Computer Vision
Backend Frameworks
PyTorch
#computer-vision#object-detection#deep-learning

timesler/facenet-pytorch

Pretrained PyTorch models for face detection and facial recognition, useful for building computer vision applications.

5.1K
Stable
Python
Computer Vision
PyTorch
#face-detection#face-recognition#facial-recognition

ThoughtfulDev/EagleEye

A Python tool that uses image recognition and reverse image search to find people's social media profiles, designed for 'vibe coders'.

5.0K
Archived
Python
Computer Vision
General Utilities
Python
#face-recognition#reverse-image-search#social-media

open-mmlab/mmaction2

OpenMMLab's toolbox and benchmark for advanced video understanding and action recognition.

4.9K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#video-classification#action-recognition#benchmark

macanv/BERT-BiLSTM-CRF-NER

A Tensorflow solution for Named Entity Recognition (NER) using the BERT-BiLSTM-CRF model with BERT fine-tuning.

4.9K
Archived
Python
NER
API Frameworks
TensorFlow
#bert#named-entity-recognition#ner

ChanChiChoi/awesome-Face_Recognition

A curated collection of papers and resources related to various aspects of face recognition technology.

4.7K
Archived
Computer Vision
Tutorials & Courses
#face-detection#face-recognition#face-alignment

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K
Active
Python
AI Voice & Speech
CLI Tools
Python
#speech-recognition#voice-activation#wake-word-detection

open-mmlab/mmocr

An open-source toolbox for text detection, recognition, and understanding tasks powered by PyTorch.

4.7K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#ocr#text-detection#text-recognition

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K
Archived
Jupyter Notebook
LLM Frameworks
Speech-to-Text
JAX
#speech-recognition#whisper#jax
1...46...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.