Explore Projects

Discover 5 open source projects

Active filters (1):
Search: table-extractionร—
Clear all

Showing 1-5 of 5 projects

jsvine/pdfplumber

A Python library that provides a powerful API for extracting text and tables from PDF files.

9.8K
Active
Python
API Frameworks
Python
#pdf#pdf-parsing#table-extraction

pymupdf/PyMuPDF

A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.

9.2K
Active
Python
Document Processing
#pdf#data-extraction#text-processing

kreuzberg-dev/kreuzberg

A polyglot document intelligence framework with a Rust core for extracting text, metadata, and structured information from various file formats.

6.6K
Active
HTML
API Clients & Testing
API Documentation
#document-intelligence#metadata-extraction#pdf-extraction

microsoft/table-transformer

Deep learning model for extracting & analyzing table structures from PDFs and images with datasets.

2.9K
Archived
Python
Computer Vision
ETL & Pipelines
PyTorch
#table-extraction#computer-vision#document-processing

NanoNets/docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit.

1.9K
Stable
Python
Computer Vision
API Frameworks
Python
#document-analysis#document-data-extraction#ocr-benchmark

Stay in the loop

Get weekly updates on trending AI coding tools and projects.