Explore Projects

Discover 6 open source projects

Active filters (1):
Search: layout-analysisร—
Clear all

Showing 1-6 of 6 projects

opendatalab/MinerU

Converts complex documents into LLM-ready formats for agentic workflows

55.5K
Active
Python
Agents & Orchestration
Agent Coordination
Python
#document-analysis#pdf-extraction#llm-workflows

bytedance/Dolphin

Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.

8.9K
Stable
Python
Computer Vision
API Frameworks
Python
#document-analysis#layout-analysis#ocr

Layout-Parser/layout-parser

A unified toolkit for deep learning-based document image analysis and layout parsing.

5.7K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#document-processing#layout-analysis#object-detection

breezedeus/Pix2Text

An open-source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them to Markdown.

3.0K
Experimental
Jupyter Notebook
Computer Vision
File Storage
PyTorch
#ocr#latex#math-formula-recognition

UglyToad/PdfPig

A C# library for reading and extracting text and other content from PDF files, ported from the Java PDFBox library.

2.4K
Stable
C#
API Frameworks
Databases
#pdf#pdf-extraction#document-analysis

kotaro-kinoshita/yomitoku

An AI-powered document image analysis package designed specifically for the Japanese language.

1.3K
Active
Python
Computer Vision
API Frameworks
PyTorch
#deep-learning#ocr#layout-analysis

Stay in the loop

Get weekly updates on trending AI coding tools and projects.