Explore Projects

Discover 426 open source projects

Active filters (1):

Search: recognition×

Clear all

Showing 241-260 of 426 projects

wanghaisheng/awesome-ocr

A curated list of promising OCR (Optical Character Recognition) resources for developers.

1.7K

Archived

API Frameworks

Computer Vision

#ocr#optical-character-recognition#computer-vision

strob/gentle

A Python library for forced audio alignment, useful for speech recognition and audio processing tasks.

1.7K

Experimental

Python

API Frameworks

Caching

#audio-processing#speech-recognition#forced-alignment

MCG-NJU/VideoMAE

A self-supervised video representation learning model for video understanding tasks.

1.7K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#video-analysis#video-understanding#self-supervised-learning

PaddlePaddle/PaddleVideo

PaddleVideo is a powerful toolkit for video understanding tasks like action recognition, localization, and detection.

1.7K

Experimental

Python

Computer Vision

API Frameworks

Python

#video-recognition#action-detection#action-localization

ckiplab/ckiptagger

A Python library for Chinese word segmentation, part-of-speech tagging, and named entity recognition.

1.7K

Experimental

Python

Natural Language Processing

Python

#natural-language-processing#word-segmentation#pos-tagging

undertheseanlp/underthesea

Underthesea is a powerful Vietnamese NLP toolkit for developers working with natural language processing tasks.

1.7K

Active

Python

LLM Frameworks

API Frameworks

#vietnamese#nlp#natural-language-processing

chongyangtao/Awesome-Scene-Text-Recognition

A curated list of resources dedicated to scene text localization and recognition.

1.7K

Archived

Text Detection

Text Recognition

#natural-images#scene-texts#text-detection

MarkPDFdown/markpdfdown

A high-quality PDF to Markdown conversion tool powered by large language model visual recognition.

1.7K

Active

Python

LLM Wrappers & SDKs

ETL & Pipelines

Python

#pdf-converter#markdown-generation#llm-integration

alan-ai/alan-sdk-ionic

A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.

1.7K

Experimental

TypeScript

React

#ionic#chatbot#conversational-ai

HoshinoSuzumi/chronoframe

Self-hosted personal gallery app with online photo management, EXIF parsing, geolocation, and WebGL viewer.

1.7K

Active

Vue

Component Libraries (Vue/Svelte)

Frontend Frameworks

Vue

#photo-gallery#exif-extraction#geocoding

szczyglis-dev/py-gpt

A Python-based desktop AI assistant that integrates with various LLMs and AI tools for coding, task automation, and more.

1.7K

Active

Python

LLM Frameworks

Agents & Orchestration

Python

#ai-assistant#llm#automation

chrismattmann/tika-python

A Python binding to the Apache Tika REST service, enabling text extraction and parsing in Python.

1.6K

Experimental

Python

API Clients & Testing

Data Processing

Python

#text-extraction#text-processing#data-extraction

neural-maze/ava-whatsapp-agent-course

A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.

1.6K

Stable

Python

Agents & Orchestration

AI Voice & Speech

#agent#stt#tts

pemistahl/lingua-py

A highly accurate natural language detection library for Python, suitable for short and mixed-language text.

1.6K

Stable

Python

API Clients & Testing

Data Processing

Python

#language-detection#language-classification#natural-language-processing

absadiki/subsai

A Python-based tool for generating subtitles using OpenAI's Whisper speech recognition model.

1.6K

Stable

Python

LLM Wrappers & SDKs

API Frameworks

Python

#subtitles#speech-recognition#whisper

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K

Stable

C++

AI Voice & Speech

Cross-Platform

#speech-recognition#voice-activity-detection#offline

msgi/nlp-journey

A collection of Python code and resources related to natural language processing tasks like topic modeling, text classification, and machine translation.

1.6K

Active

Python

LLM Frameworks

Databases

Python

#natural-language-processing#deep-learning#topic-modeling

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K

Active

Swift

AI Voice & Speech

iOS

Swift

#text-to-speech#speech-to-text#voice-activity-detection

myhhub/KnowledgeGraph

This Python project helps developers build knowledge graphs from scratch, including named entity recognition, relation extraction, and question answering.

1.6K

Archived

Python

Knowledge Representation

Databases

#knowledge-graph#named-entity-recognition#relation-extraction