Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognition×
Clear all

Showing 21-40 of 426 projects

Zeyi-Lin/HivisionIDPhotos

AI-powered ID photo generation with face detection and matting

20.8K
Experimental
Python
Computer Vision
General Utilities
FastAPI
#id-photo-generator#face-recognition#image-matting

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

MaaAssistantArknights/MaaAssistantArknights

Arknights game automation with image recognition

19.8K
Active
C++
Computer Vision
Agent Coordination
C++
#arknights#game-automation#computer-vision

datalab-to/surya

Document OCR toolkit for 90+ languages with layout analysis, reading order detection, and table recognition

19.4K
Active
Python
Computer Vision
Python
#ocr#document-analysis#layout-detection

antlr/antlr4

ANTLR is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

18.8K
Active
Java
CLI Tools
#parser-generator#language-recognition#parsing

justadudewhohacks/face-api.js

A TypeScript library for face detection and face recognition in the browser and Node.js using TensorFlow.js.

17.8K
Archived
TypeScript
Computer Vision
React
#face-detection#face-recognition#computer-vision

pot-app/pot-desktop

A cross-platform software for text translation and recognition, focused on vibe coders.

17.2K
Active
JavaScript
Component Libraries (React)
React
#ocr#translate#translation

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K
Active
Python
React
#generative-ai#machine-learning#neural-networks

cmusatyalab/openface

Face recognition with deep neural networks using Lua.

15.4K
Archived
Lua
React
#face-recognition#deep-learning#facenet

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K
Archived
Jupyter Notebook
ML Ops
PyTorch
#deep-learning#computer-vision#natural-language-processing

flairNLP/flair

A simple, state-of-the-art NLP framework for tasks like named entity recognition and semantic role labeling.

14.4K
Stable
Python
NLP Frameworks
PyTorch
#natural-language-processing#machine-learning#sequence-labeling

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

davidsandberg/facenet

Face recognition using the FaceNet deep learning model, built with TensorFlow.

14.3K
Archived
Python
Computer Vision
TensorFlow
#face-detection#face-recognition#deep-learning

PaddlePaddle/PaddleDetection

PaddleDetection is an open-source object detection toolkit based on the PaddlePaddle deep learning framework, supporting various computer vision tasks.

14.1K
Stable
Python
Computer Vision
Python
#object-detection#instance-segmentation#multi-object-tracking

kmario23/deep-learning-drizzle

A comprehensive collection of deep learning, reinforcement learning, and machine learning resources for vibe coders.

12.8K
Archived
HTML
Machine Learning
#deep-learning#machine-learning#computer-vision

Anionex/banana-slides

An AI-powered PPT generator that allows users to create visually stunning presentations with just a few clicks.

12.7K
Active
Python
LLM Frameworks
React
#ai-ppt-maker#text2image#editable-pptx

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation
13...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.