Explore Projects

Discover 8 open source projects

Active filters (1):
Search: document-parsingร—
Clear all

Showing 1-8 of 8 projects

PaddlePaddle/PaddleOCR

PaddleOCR converts documents/images to structured data for AI apps

71.6K
Active
Python
Computer Vision
MCP Servers
PaddlePaddle
#ocr#document-parsing#ai4science

docling-project/docling

Converts documents to AI-ready formats with advanced parsing

55.0K
Active
Python
Computer Vision
CLI Tools
#document-parsing#pdf-converter#ocr

Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

14.1K
Active
HTML
Document Processing
#document-processing#data-pipelines#natural-language-processing

run-llama/llama_cloud_services

A set of TypeScript-based cloud services and utilities for processing and extracting structured data from various document formats.

4.2K
Active
TypeScript
File Storage
Caching
TypeScript
#document-parsing#pdf-processing#structured-data

opendataloader-project/opendataloader-pdf

Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.

1.8K
Active
Java
RAG Frameworks
RAG & Vector
Java
#pdf-parser#rag-pipeline#markdown-conversion

enoch3712/ExtractThinker

ExtractThinker is a powerful document intelligence library for LLMs, offering flexible and intuitive workflows.

1.5K
Stable
Python
LLM Frameworks
ORMs & Query Builders
Python
#document-intelligence#llm#ocr

NanoNets/docstrange

An intelligent document parsing tool that extracts and converts data from various document formats to structured data like Markdown, JSON, CSV, and HTML.

1.4K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#ocr#pdf-parser#document-parsing

Topdu/OpenOCR

An open-source toolkit for general OCR research and applications, with integrated training, evaluation, and production-ready OCR systems.

1.3K
Active
Python
Computer Vision
Backend Frameworks
PyTorch
#ocr#document-processing#computer-vision

Stay in the loop

Get weekly updates on trending AI coding tools and projects.