Explore Projects

Discover 4 open source projects

Active filters (1):
Search: pdf-to-textร—
Clear all

Showing 1-4 of 4 projects

docling-project/docling

Converts documents to AI-ready formats with advanced parsing

55.0K
Active
Python
Computer Vision
CLI Tools
#document-parsing#pdf-converter#ocr

Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

14.1K
Active
HTML
Document Processing
#document-processing#data-pipelines#natural-language-processing

run-llama/llama_cloud_services

A set of TypeScript-based cloud services and utilities for processing and extracting structured data from various document formats.

4.2K
Active
TypeScript
File Storage
Caching
TypeScript
#document-parsing#pdf-processing#structured-data

enoch3712/ExtractThinker

ExtractThinker is a powerful document intelligence library for LLMs, offering flexible and intuitive workflows.

1.5K
Stable
Python
LLM Frameworks
ORMs & Query Builders
Python
#document-intelligence#llm#ocr

Stay in the loop

Get weekly updates on trending AI coding tools and projects.