Explore Projects

Discover 172 open source projects

Active filters (1):
Search: ocr×
Clear all

Showing 1-20 of 172 projects

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K
Archived
Python
LLM Frameworks
RAG & Vector
Python
#nlp#chinese-nlp#ai-resources

Stirling-Tools/Stirling-PDF

Open-source PDF platform for editing, converting, and automating PDFs with desktop, browser, and self-hosted options.

75.0K
Active
TypeScript
CLI Tools
General Utilities
TypeScript
#pdf-editor#pdf-converter#pdf-tools

tesseract-ocr/tesseract

OCR engine for text recognition in images

72.7K
Active
C++
Computer Vision
CLI Tools
#ocr#computer-vision#command-line

PaddlePaddle/PaddleOCR

PaddleOCR converts documents/images to structured data for AI apps

71.6K
Active
Python
Computer Vision
MCP Servers
PaddlePaddle
#ocr#document-parsing#ai4science

opendatalab/MinerU

Converts complex documents into LLM-ready formats for agentic workflows

55.5K
Active
Python
Agents & Orchestration
Agent Coordination
Python
#document-analysis#pdf-extraction#llm-workflows

hiroi-sora/Umi-OCR

Offline OCR software with batch processing, PDF support, and multi-language recognition.

42.4K
Stable
Python
CLI Tools
Computer Vision
Python
#ocr#python#paddleocr

siyuan-note/siyuan

Privacy-first, self-hosted knowledge management with markdown and AI integrations

41.7K
Active
TypeScript
Full-Stack Frameworks
RAG & Vector
Electron
#knowledge-base#markdown#local-first

naptha/tesseract.js

JavaScript OCR library for image text extraction

37.9K
Active
JavaScript
Computer Vision
General Utilities
Node.js
#ocr#javascript#tesseract

paperless-ngx/paperless-ngx

Document management system for scanning, indexing, and archiving documents

37.1K
Active
Python
Collaboration & Real-time
Documentation
Django
#document-management#ocr#machine-learning

ShareX/ShareX

Screen capture and file sharing tool for developers

35.8K
Active
C#
CLI Tools
#screen-capture#file-sharing#csharp

ocrmypdf/OCRmyPDF

Adds OCR text layer to scanned PDFs for searchability

32.8K
Active
Python
CLI Tools
Computer Vision
#ocr#pdf-processing#command-line

JaidedAI/EasyOCR

OCR library with 80+ languages and scripts support

29.0K
Stable
Python
Computer Vision
#ocr#computer-vision#image-processing

deepseek-ai/DeepSeek-OCR

DeepSeek-OCR for visual-text compression and OCR tasks

22.6K
Active
Python
Computer Vision
Inference
vLLM
#ocr#computer-vision#inference

datalab-to/surya

Document OCR toolkit for 90+ languages with layout analysis, reading order detection, and table recognition

19.4K
Active
Python
Computer Vision
Python
#ocr#document-analysis#layout-detection

pot-app/pot-desktop

A cross-platform software for text translation and recognition, focused on vibe coders.

17.2K
Active
JavaScript
Component Libraries (React)
React
#ocr#translate#translation

lukas-blecher/LaTeX-OCR

A deep learning model that converts images of mathematical equations into LaTeX code.

16.2K
Archived
Python
Computer Vision
PyTorch
#ocr#latex#math

Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

14.1K
Active
HTML
Document Processing
#document-processing#data-pipelines#natural-language-processing

sml2h3/ddddocr

A Python library for easily recognizing and solving CAPTCHA challenges, useful for vibe coders building AI-powered apps.

13.6K
Active
Python
Computer Vision
#captcha#ocr#computer-vision

tisfeng/Easydict

Easydict is a concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

12.4K
Active
Swift
React
#dictionary#translator#macos

DayBreak-u/chineseocr_lite

A lightweight Chinese OCR library that supports vertical text recognition and NCNN/MNN/TNN inference with a small model size.

12.3K
Archived
C++
Computer Vision
PyTorch
#ocr#computer-vision#ncnn
2...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.