Explore Projects

Discover 426 open source projects

Active filters (1):
Search: recognitionร—
Clear all

Showing 241-260 of 426 projects

wanghaisheng/awesome-ocr

A curated list of promising OCR (Optical Character Recognition) resources for developers.

1.7K
Archived
API Frameworks
Computer Vision
#ocr#optical-character-recognition#computer-vision

strob/gentle

A Python library for forced audio alignment, useful for speech recognition and audio processing tasks.

1.7K
Experimental
Python
API Frameworks
Caching
#audio-processing#speech-recognition#forced-alignment

MCG-NJU/VideoMAE

A self-supervised video representation learning model for video understanding tasks.

1.7K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#video-analysis#video-understanding#self-supervised-learning

PaddlePaddle/PaddleVideo

PaddleVideo is a powerful toolkit for video understanding tasks like action recognition, localization, and detection.

1.7K
Experimental
Python
Computer Vision
API Frameworks
Python
#video-recognition#action-detection#action-localization

ckiplab/ckiptagger

A Python library for Chinese word segmentation, part-of-speech tagging, and named entity recognition.

1.7K
Experimental
Python
Natural Language Processing
Python
#natural-language-processing#word-segmentation#pos-tagging

undertheseanlp/underthesea

Underthesea is a powerful Vietnamese NLP toolkit for developers working with natural language processing tasks.

1.7K
Active
Python
LLM Frameworks
API Frameworks
#vietnamese#nlp#natural-language-processing

chongyangtao/Awesome-Scene-Text-Recognition

A curated list of resources dedicated to scene text localization and recognition.

1.7K
Archived
Text Detection
Text Recognition
#natural-images#scene-texts#text-detection

MarkPDFdown/markpdfdown

A high-quality PDF to Markdown conversion tool powered by large language model visual recognition.

1.7K
Active
Python
LLM Wrappers & SDKs
ETL & Pipelines
Python
#pdf-converter#markdown-generation#llm-integration

alan-ai/alan-sdk-ionic

A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.

1.7K
Experimental
TypeScript
React
#ionic#chatbot#conversational-ai

HoshinoSuzumi/chronoframe

Self-hosted personal gallery app with online photo management, EXIF parsing, geolocation, and WebGL viewer.

1.7K
Active
Vue
Component Libraries (Vue/Svelte)
Frontend Frameworks
Vue
#photo-gallery#exif-extraction#geocoding

szczyglis-dev/py-gpt

A Python-based desktop AI assistant that integrates with various LLMs and AI tools for coding, task automation, and more.

1.7K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-assistant#llm#automation

chrismattmann/tika-python

A Python binding to the Apache Tika REST service, enabling text extraction and parsing in Python.

1.6K
Experimental
Python
API Clients & Testing
Data Processing
Python
#text-extraction#text-processing#data-extraction

neural-maze/ava-whatsapp-agent-course

A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.

1.6K
Stable
Python
Agents & Orchestration
AI Voice & Speech
#agent#stt#tts

pemistahl/lingua-py

A highly accurate natural language detection library for Python, suitable for short and mixed-language text.

1.6K
Stable
Python
API Clients & Testing
Data Processing
Python
#language-detection#language-classification#natural-language-processing

absadiki/subsai

A Python-based tool for generating subtitles using OpenAI's Whisper speech recognition model.

1.6K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#subtitles#speech-recognition#whisper

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K
Stable
C++
AI Voice & Speech
Cross-Platform
#speech-recognition#voice-activity-detection#offline

msgi/nlp-journey

A collection of Python code and resources related to natural language processing tasks like topic modeling, text classification, and machine translation.

1.6K
Active
Python
LLM Frameworks
Databases
Python
#natural-language-processing#deep-learning#topic-modeling

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

myhhub/KnowledgeGraph

This Python project helps developers build knowledge graphs from scratch, including named entity recognition, relation extraction, and question answering.

1.6K
Archived
Python
Knowledge Representation
Databases
#knowledge-graph#named-entity-recognition#relation-extraction

iMoonLab/yolov13

Implementation of the state-of-the-art YOLOv13 object detection model with hypergraph-enhanced visual perception.

1.6K
Stable
Python
Computer Vision
API Frameworks
Python
#object-detection#real-time#hypergraph-learning
1...1214...22

Stay in the loop

Get weekly updates on trending AI coding tools and projects.