Explore Projects

Discover 5 open source projects

Active filters (1):
Search: pdf-to-markdownร—
Clear all

Showing 1-5 of 5 projects

run-llama/llama_cloud_services

A set of TypeScript-based cloud services and utilities for processing and extracting structured data from various document formats.

4.2K
Active
TypeScript
File Storage
Caching
TypeScript
#document-parsing#pdf-processing#structured-data

chatdoc-com/OCRFlux

OCRFlux is a powerful PDF-to-Markdown conversion toolkit with advanced layout handling, table parsing, and cross-page content merging.

2.5K
Experimental
Python
Computer Vision
API Frameworks
Python
#pdf-conversion#markdown-generation#layout-handling

opendataloader-project/opendataloader-pdf

Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.

1.8K
Active
Java
RAG Frameworks
RAG & Vector
Java
#pdf-parser#rag-pipeline#markdown-conversion

NanoNets/docstrange

An intelligent document parsing tool that extracts and converts data from various document formats to structured data like Markdown, JSON, CSV, and HTML.

1.4K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#ocr#pdf-parser#document-parsing

wisupai/e2m

E2M is a flexible, open-source tool that converts various file types to Markdown, making it easy for vibe coders to work with content.

1.3K
Archived
Jupyter Notebook
LLM Frameworks
File Storage
Jupyter Notebook
#markdown#text-cleaning#doc2x

Stay in the loop

Get weekly updates on trending AI coding tools and projects.