Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognitionร—
Clear all

Showing 141-160 of 426 projects

xiaofengShi/CHINESE-OCR

A Python-based OCR library that uses CTPN, CRNN, and CTC to perform text detection and recognition in natural scenes.

3.0K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#ocr#text-detection#text-recognition

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#video-translation#whisper

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K
Stable
Svelte
AI Voice & Speech
Frontend Frameworks
Svelte
#speech-recognition#speech-to-text#transcription

biometrics/openbr

Open-source biometrics and face recognition library written in C++.

2.9K
Active
C++
Computer Vision
#biometrics#face-recognition#computer-vision

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

urchade/GLiNER

A lightweight and generalist NER model for extracting entities from text, with support for prompt-tuning.

2.9K
Active
Python
LLM Frameworks
Named Entity Recognition
Python
#information-extraction#named-entity-recognition#natural-language-processing

microsoft/table-transformer

Deep learning model for extracting & analyzing table structures from PDFs and images with datasets.

2.9K
Archived
Python
Computer Vision
ETL & Pipelines
PyTorch
#table-extraction#computer-vision#document-processing

modelscope/3D-Speaker

A library for single- and multi-modal speaker verification, recognition, and diarization.

2.8K
Stable
Python
Computer Vision
AI Voice & Speech
Python
#speaker-verification#speaker-recognition#speaker-diarization

YCG09/chinese_ocr

A Chinese OCR (Optical Character Recognition) library built using CTPN, DenseNet, and CTC.

2.8K
Archived
Python
Computer Vision
Tensorflow
#ocr#computer-vision#text-recognition

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

hooram/ownphotos

A self-hosted alternative to Google Photos with features like face detection and object recognition.

2.8K
Archived
Jupyter Notebook
API Frameworks
Authentication
Django
#photos#gallery#self-hosted

rhasspy/rhasspy

Offline private voice assistant for many human languages, built with privacy and security in mind.

2.7K
Experimental
Shell
API Frameworks
AI Voice & Speech
Node
#voice-assistant#speech-recognition#privacy

blmoistawinde/HarvestText

A versatile NLP toolkit for text mining and preprocessing, supporting tasks like sentiment analysis, entity extraction, and keyword summarization.

2.6K
Archived
Python
NLP
CLI Tools
Python
#nlp#text-mining#sentiment-analysis

zzmp/juliusjs

A speech recognition library for the web, allowing developers to build AI-powered applications.

2.6K
Archived
JavaScript
Prompt Engineering
React
#speech recognition#web development#AI-powered

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K
Archived
C++
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#deep-learning#asr

kha-white/manga-ocr

Optical character recognition for Japanese manga comics, built with Python and deep learning.

2.6K
Experimental
Python
Computer Vision
Backend Frameworks
#comics#ocr#japanese

gerdm/prml

A collection of Jupyter notebooks with Python code and notes for the book Pattern Recognition and Machine Learning.

2.6K
Archived
Jupyter Notebook
Machine Learning
Tutorials & Courses
#machine-learning#pattern-recognition#bayesian-statistics

detectRecog/CCPD

A diverse and well-annotated dataset for license plate detection and recognition

2.5K
Archived
Python
Computer Vision
Datasets
#ccpd#dataset#detection

X-PLUG/mPLUG-Owl

A powerful multi-modal large language model family for building advanced AI chatbots and visual recognition models.

2.5K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatbot#gpt#multimodal

hwalsuklee/awesome-deep-text-detection-recognition

A curated list of resources for text detection/recognition with deep learning methods

2.5K
Archived
React
#OCR#Deep Learning#Text Detection
1...79...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.