Showing 21-40 of 172 projects
OCR and document extraction using vision models
A powerful app for advanced image manipulation with dozens of features
A privacy-focused PDF toolkit for developers, providing a range of PDF-related tools and utilities.
An open LLM devops platform for building next-gen enterprise AI applications with powerful features like GenAI workflow, RAG, Agent, and model management.
An entry-level project for face, video, and text detection and recognition using Python and popular AI/ML libraries.
A C++ library for translating visual novels and galgames through OCR and reverse-engineering techniques.
Automatically convert documentation, GitHub repos, and PDFs into Claude AI skills with conflict detection.
Bob is a macOS app that provides translation and OCR capabilities for developers who work with AI tools.
An open-source project that uses deep learning and OCR to translate text in manga/images
A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.
Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.
A Python tool for extracting hard-coded subtitles from videos and generating SRT files using deep learning-based OCR.
Effortless data labeling with AI support from Segment Anything and other powerful models.
Open-source implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model.
A Python-based document management system for scanning, indexing, and archiving paper documents.
Provides AI-powered tools and samples for developers to build cutting-edge applications with Microsoft AI.
A collection of trained models for the Tesseract OCR engine, a powerful open-source optical character recognition tool.
A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.
Donut is an OCR-free Document Understanding Transformer and Synthetic Document Generator for computer vision and document AI tasks.
pytesseract is a Python wrapper for Google Tesseract, a popular optical character recognition (OCR) engine.
Get weekly updates on trending AI coding tools and projects.