Explore Projects

Discover 353 open source projects

Active filters (1):
Search: pdf×
Clear all

Showing 141-160 of 353 projects

FastReports/FastReport

A free open-source reporting tool for .NET that helps generate document-like reports in applications.

3.0K
Stable
C#
API Frameworks
Backend Frameworks
.NET
#reporting#pdf-generation#dotnet

CatchTheTornado/text-extract-api

An API for extracting, anonymizing, and parsing text from various document formats using state-of-the-art OCR and LLM models.

3.0K
Stable
Python
LLM Wrappers & SDKs
API Clients & Testing
Python
#anonymization#ocr#pdf

mynane/PDF

A collection of various resources for developers, including AI coding tools and utilities.

2.9K
Archived
JavaScript
AI Code Editors
JavaScript
#ai-tools#developer-resources#utilities

pdfkit/pdfkit

A Ruby gem that transforms HTML and CSS into PDFs using the wkhtmltopdf command-line utility.

2.9K
Archived
Ruby
Backend Frameworks
CLI Tools
#html-to-pdf#pdf-generation#ruby-gem

MashiroSaber03/Saber-Translator

AI-powered manga translator with OCR & bubble detection for Japanese comics to Chinese

2.9K
Active
Python
Computer Vision
LLM Wrappers & SDKs
Python
#manga-translation#ocr#ai-detection

ciur/papermerge

Self-hosted document management system with OCR for scanning and archiving papers digitally.

2.9K
Stable
Python
API Frameworks
ETL & Pipelines
Django
#document-management-system#ocr-scanning#paperless

cirosantilli/china-dictatorship

Political activism documentation on Chinese government censorship, human rights, and censorship circumvention techniques.

2.9K
Active
HTML
Resource Collections
Privacy Tools
#censorship-circumvention#china-dictatorship#human-rights

Xmader/musescore-downloader

A tool to download sheet music from Musescore.com for free, without login or Musescore Pro.

2.8K
Archived
TypeScript
Backend Frameworks
General Utilities
TypeScript
#musescore#sheet-music#download

barryvdh/laravel-snappy

Laravel Snappy PDF is a Laravel package that provides a convenient wrapper for the wkhtmltopdf library, allowing you to generate PDFs from HTML.

2.8K
Experimental
PHP
API Frameworks
Backend Frameworks
Laravel
#pdf#html-to-pdf#laravel

ArtifexSoftware/mupdf

mupdf is a lightweight PDF and XPS viewer and toolkit written in C.

2.6K
Active
C
API Frameworks
CLI Tools
#pdf#xps#viewer

vsch/flexmark-java

A powerful Java library for parsing CommonMark/Markdown with advanced features like HTML to Markdown conversion and PDF/DOCX export.

2.6K
Experimental
Java
Backend Frameworks
API Clients & Testing
Java
#markdown#commonmark#html-to-markdown

zhoubear/open-paperless

An open-source document management system that allows you to scan, index, and archive paper documents.

2.6K
Archived
Python
API Frameworks
Databases
#document-management#ocr#paperless

OnedocLabs/react-print-pdf

A React library for building and generating PDF documents with simple, reusable components and templates.

2.5K
Archived
TypeScript
Component Libraries (React)
File Storage
React
#pdf#pdf-generator#pdf-manipulation

simonbengtsson/jsPDF-AutoTable

A TypeScript plugin for jsPDF to generate PDF tables with JavaScript.

2.5K
Active
TypeScript
Component Libraries (React)
Full-Stack Frameworks
JavaScript
#jspdf#pdf#tables

chatdoc-com/OCRFlux

OCRFlux is a powerful PDF-to-Markdown conversion toolkit with advanced layout handling, table parsing, and cross-page content merging.

2.5K
Experimental
Python
Computer Vision
API Frameworks
Python
#pdf-conversion#markdown-generation#layout-handling

tefkah/zotero-night

A night theme for the Zotero reference management software UI and PDF viewer.

2.5K
Active
SCSS
Component Libraries (React)
Animation & Motion
#zotero#theme#ui

Anil-matcha/Open-Higgsfield-AI

A chatbot that allows you to chat with and extract information from PDF documents using language models and AI.

2.5K
Archived
Jupyter Notebook
LLM Frameworks
Agents & Orchestration
Jupyter Notebook
#chatbot#chatgpt#pdf

ogkalu2/comic-translate

A Python desktop app for automatically translating comics in various formats and languages using computer vision and machine translation.

2.4K
Active
Python
Computer Vision
Neural Network
PySide6
#comics#manga#translation

openpaperwork/paperwork

A personal document manager for Linux and Windows, focused on indexing, OCR, and managing PDF files.

2.4K
Archived
Python
CLI Tools
Authentication
Python
#document-management#ocr#pdf

UglyToad/PdfPig

A C# library for reading and extracting text and other content from PDF files, ported from the Java PDFBox library.

2.4K
Stable
C#
API Frameworks
Databases
#pdf#pdf-extraction#document-analysis
1...79...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.