Explore Projects

Discover 426 open source projects

Active filters (1):

Search: recognition×

Clear all

Showing 81-100 of 426 projects

madmaze/pytesseract

pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.

6.3K

Active

Python

Computer Vision

#ocr#image-processing#text-extraction

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

PaddlePaddle/PaddleX

PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.

6.1K

Active

Python

Computer Vision

Natural Language Processing

Python

#computer-vision#natural-language-processing#ocr

dmlc/gluon-cv

An open-source toolkit for computer vision tasks, supporting a wide range of deep learning models and applications.

5.9K

Archived

Python

Computer Vision

ML Ops

MXNet

#computer-vision#deep-learning#machine-learning

mindee/doctr

docTR is a high-performing and accessible library for OCR-related tasks powered by deep learning.

5.9K

Stable

Python

Computer Vision

API Frameworks

PyTorch

#ocr#text-detection#text-recognition

PaddlePaddle/PaddleClas

A comprehensive visual classification and recognition library powered by the PaddlePaddle deep learning framework.

5.8K

Stable

Python

Computer Vision

API Frameworks

Python

#image-classification#image-recognition#image-retrieval

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K

Active

Swift

AI Voice & Speech

iOS

#speech-recognition#transformers#inference

cgzirim/seek-tune

An open-source implementation of the Shazam audio fingerprinting algorithm for song recognition in Go.

5.6K

Stable

API Frameworks

Audio Processing

#audio-fingerprinting#shazam#song-recognition

Shawn-Shan/fawkes

Fawkes is a privacy-preserving tool against facial recognition systems, built using Python.

5.5K

Archived

Python

Computer Vision

Privacy Tools

#adversarial-machine-learning#face-recognition#privacy-enhancing-technologies

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

modelscope/FunClip

Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities

5.4K

Experimental

Python

LLM Frameworks

AI Voice & Speech

gradio

#speech-recognition#video-subtitles#llm

amdegroot/ssd.pytorch

A PyTorch-based implementation of the Single Shot MultiBox Detector for object detection in computer vision tasks.

5.2K

Archived

Python

Computer Vision

Backend Frameworks

PyTorch

#computer-vision#object-detection#deep-learning

timesler/facenet-pytorch

Pretrained PyTorch models for face detection and facial recognition, useful for building computer vision applications.

5.1K

Stable

Python

Computer Vision

PyTorch

#face-detection#face-recognition#facial-recognition

ThoughtfulDev/EagleEye

A Python tool that uses image recognition and reverse image search to find people's social media profiles, designed for 'vibe coders'.

5.0K

Archived

Python

Computer Vision

General Utilities

Python

#face-recognition#reverse-image-search#social-media

open-mmlab/mmaction2

OpenMMLab's toolbox and benchmark for advanced video understanding and action recognition.

4.9K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#video-classification#action-recognition#benchmark

macanv/BERT-BiLSTM-CRF-NER

A Tensorflow solution for Named Entity Recognition (NER) using the BERT-BiLSTM-CRF model with BERT fine-tuning.

4.9K

Archived

Python

NER

API Frameworks

TensorFlow

#bert#named-entity-recognition#ner

ChanChiChoi/awesome-Face_Recognition

A curated collection of papers and resources related to various aspects of face recognition technology.

4.7K

Archived

Computer Vision

Tutorials & Courses

#face-detection#face-recognition#face-alignment

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K

Active

Python

AI Voice & Speech

CLI Tools

Python

#speech-recognition#voice-activation#wake-word-detection

open-mmlab/mmocr

An open-source toolbox for text detection, recognition, and understanding tasks powered by PyTorch.

4.7K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#ocr#text-detection#text-recognition

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K

Archived

Jupyter Notebook

LLM Frameworks

Speech-to-Text

JAX

#speech-recognition#whisper#jax

1...46...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.