Explore Projects

Discover 172 open source projects

Active filters (1):

Search: ocr×

Clear all

Showing 21-40 of 172 projects

getomni-ai/zerox

OCR and document extraction using vision models

12.2K

Experimental

TypeScript

AI Editors/Agents/Copilot

TensorFlow

#Machine Learning#Computer Vision#Natural Language Processing

T8RIN/ImageToolbox

A powerful app for advanced image manipulation with dozens of features

12.0K

Active

Kotlin

React

#image-manipulation#photo-editor#android-app

alam00000/bentopdf

A privacy-focused PDF toolkit for developers, providing a range of PDF-related tools and utilities.

11.8K

Active

JavaScript

Component Libraries (React)

React

#pdf#pdf-viewer#pdf-editor

dataelement/bisheng

An open LLM devops platform for building next-gen enterprise AI applications with powerful features like GenAI workflow, RAG, Agent, and model management.

11.1K

Active

TypeScript

LLM Frameworks

React

#ai#llm#genai

vipstone/faceai

An entry-level project for face, video, and text detection and recognition using Python and popular AI/ML libraries.

11.1K

Archived

Python

Computer Vision

#face-detection#video-processing#text-recognition

HIllya51/LunaTranslator

A C++ library for translating visual novels and galgames through OCR and reverse-engineering techniques.

10.8K

Active

C++

Animation & Motion

#visual-novel#ocr#reverse-engineering

yusufkaraaslan/Skill_Seekers

Automatically convert documentation, GitHub repos, and PDFs into Claude AI skills with conflict detection.

10.2K

Active

Python

AI Code Generation

MCP Servers

Python

#ai-tools#automation#claude-ai

ripperhe/Bob

Bob is a macOS app that provides translation and OCR capabilities for developers who work with AI tools.

9.6K

Stable

LLM Frameworks

macOS

#chatgpt#ocr#translate

zyddnys/manga-image-translator

An open-source project that uses deep learning and OCR to translate text in manga/images

9.5K

Stable

Python

Computer Vision

PyTorch

#anime#image-processing#machine-translation

pymupdf/PyMuPDF

A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.

9.2K

Active

Python

Document Processing

#pdf#data-extraction#text-processing

bytedance/Dolphin

Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.

8.9K

Stable

Python

Computer Vision

API Frameworks

Python

#document-analysis#layout-analysis#ocr

YaoFANGUK/video-subtitle-extractor

A Python tool for extracting hard-coded subtitles from videos and generating SRT files using deep learning-based OCR.

8.5K

Stable

Python

Computer Vision

API Frameworks

#ocr#subtitles#srt

CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other powerful models.

8.3K

Active

Python

Computer Vision

ML Ops

Python

#artificial-intelligence#computer-vision#image-annotation

Ucas-HaoranWei/GOT-OCR2.0

Open-source implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model.

8.1K

Experimental

Python

Computer Vision

#ocr#computer-vision#machine-learning

the-paperless-project/paperless

A Python-based document management system for scanning, indexing, and archiving paper documents.

7.9K

Archived

Python

API Frameworks

#archiving#documents#ocr

microsoft/ailab

Provides AI-powered tools and samples for developers to build cutting-edge applications with Microsoft AI.

7.9K

Archived

AI SDKs & Wrappers

API Frameworks

React

#ai#computer-vision#object-detection

tesseract-ocr/tessdata

A collection of trained models for the Tesseract OCR engine, a powerful open-source optical character recognition tool.

7.4K

Archived

Computer Vision

#ocr#computer-vision#tesseract

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K

Stable

Python

LLM Frameworks

File Storage

Python

#ingestion-api#ocr#parser-library

clovaai/donut

Donut is an OCR-free Document Understanding Transformer and Synthetic Document Generator for computer vision and document AI tasks.

6.8K

Archived

Python

React

#document-ai#computer-vision#open-source

madmaze/pytesseract

pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.

6.3K

Active

Python

Computer Vision

#ocr#image-processing#text-extraction

13...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.