Explore Projects

Discover 172 open source projects

Active filters (1):
Search: ocrร—
Clear all

Showing 21-40 of 172 projects

getomni-ai/zerox

OCR and document extraction using vision models

12.2K
Experimental
TypeScript
AI Editors/Agents/Copilot
TensorFlow
#Machine Learning#Computer Vision#Natural Language Processing

T8RIN/ImageToolbox

A powerful app for advanced image manipulation with dozens of features

12.0K
Active
Kotlin
React
#image-manipulation#photo-editor#android-app

alam00000/bentopdf

A privacy-focused PDF toolkit for developers, providing a range of PDF-related tools and utilities.

11.8K
Active
JavaScript
Component Libraries (React)
React
#pdf#pdf-viewer#pdf-editor

dataelement/bisheng

An open LLM devops platform for building next-gen enterprise AI applications with powerful features like GenAI workflow, RAG, Agent, and model management.

11.1K
Active
TypeScript
LLM Frameworks
React
#ai#llm#genai

vipstone/faceai

An entry-level project for face, video, and text detection and recognition using Python and popular AI/ML libraries.

11.1K
Archived
Python
Computer Vision
#face-detection#video-processing#text-recognition

HIllya51/LunaTranslator

A C++ library for translating visual novels and galgames through OCR and reverse-engineering techniques.

10.8K
Active
C++
Animation & Motion
#visual-novel#ocr#reverse-engineering

yusufkaraaslan/Skill_Seekers

Automatically convert documentation, GitHub repos, and PDFs into Claude AI skills with conflict detection.

10.2K
Active
Python
AI Code Generation
MCP Servers
Python
#ai-tools#automation#claude-ai

ripperhe/Bob

Bob is a macOS app that provides translation and OCR capabilities for developers who work with AI tools.

9.6K
Stable
LLM Frameworks
macOS
#chatgpt#ocr#translate

zyddnys/manga-image-translator

An open-source project that uses deep learning and OCR to translate text in manga/images

9.5K
Stable
Python
Computer Vision
PyTorch
#anime#image-processing#machine-translation

pymupdf/PyMuPDF

A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.

9.2K
Active
Python
Document Processing
#pdf#data-extraction#text-processing

bytedance/Dolphin

Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.

8.9K
Stable
Python
Computer Vision
API Frameworks
Python
#document-analysis#layout-analysis#ocr

YaoFANGUK/video-subtitle-extractor

A Python tool for extracting hard-coded subtitles from videos and generating SRT files using deep learning-based OCR.

8.5K
Stable
Python
Computer Vision
API Frameworks
#ocr#subtitles#srt

CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other powerful models.

8.3K
Active
Python
Computer Vision
ML Ops
Python
#artificial-intelligence#computer-vision#image-annotation

Ucas-HaoranWei/GOT-OCR2.0

Open-source implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model.

8.1K
Experimental
Python
Computer Vision
#ocr#computer-vision#machine-learning

the-paperless-project/paperless

A Python-based document management system for scanning, indexing, and archiving paper documents.

7.9K
Archived
Python
API Frameworks
Search
#archiving#documents#ocr

microsoft/ailab

Provides AI-powered tools and samples for developers to build cutting-edge applications with Microsoft AI.

7.9K
Archived
C#
AI SDKs & Wrappers
API Frameworks
React
#ai#computer-vision#object-detection

tesseract-ocr/tessdata

A collection of trained models for the Tesseract OCR engine, a powerful open-source optical character recognition tool.

7.4K
Archived
Computer Vision
#ocr#computer-vision#tesseract

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K
Stable
Python
LLM Frameworks
File Storage
Python
#ingestion-api#ocr#parser-library

clovaai/donut

Donut is an OCR-free Document Understanding Transformer and Synthetic Document Generator for computer vision and document AI tasks.

6.8K
Archived
Python
React
#document-ai#computer-vision#open-source

madmaze/pytesseract

pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.

6.3K
Active
Python
Computer Vision
#ocr#image-processing#text-extraction
13...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.