Explore Projects

Discover 6 open source projects

Active filters (1):

Search: layout-analysis×

Clear all

Showing 1-6 of 6 projects

opendatalab/MinerU

Converts complex documents into LLM-ready formats for agentic workflows

55.5K

Active

Python

Agents & Orchestration

Agent Coordination

Python

#document-analysis#pdf-extraction#llm-workflows

bytedance/Dolphin

Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.

8.9K

Stable

Python

Computer Vision

API Frameworks

Python

#document-analysis#layout-analysis#ocr

Layout-Parser/layout-parser

A unified toolkit for deep learning-based document image analysis and layout parsing.

5.7K

Archived

Python

Computer Vision

API Frameworks

PyTorch

#document-processing#layout-analysis#object-detection

breezedeus/Pix2Text

An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.

3.0K

Experimental

Jupyter Notebook

Computer Vision

File Storage

PyTorch

#ocr#latex#math-formula-recognition

UglyToad/PdfPig

A C# library for reading and extracting text and other content from PDF files, ported from the Java PDFBox library.

2.4K

Stable

API Frameworks

Databases

#pdf#pdf-extraction#document-analysis

kotaro-kinoshita/yomitoku

An AI-powered document image analysis package designed specifically for the Japanese language.

1.3K

Active

Python

Computer Vision

API Frameworks

PyTorch

#deep-learning#ocr#layout-analysis

Stay in the loop

Get weekly updates on trending AI coding tools and projects.