Showing 1-20 of 30 projects
PaddleOCR converts documents/images to structured data for AI apps
Converts complex documents into LLM-ready formats for agentic workflows
A Python tool for extracting hard-coded subtitles from videos and generating SRT files using deep learning-based OCR.
A YouTube Music client for Android with a modern Material Design UI and features like NewPipe integration.
A free and open-source file archiver and compression tool with support for various archive formats.
A monorepo for a set of tools developed by the Rush Stack community for TypeScript-based projects.
Gathers text and metadata from the web using crawling, scraping, and extraction techniques.
Python-based AI news extractor beta version
A Python module for automatic summarization of text documents and HTML pages.
Python/C++ Visual SLAM pipeline for 3D reconstruction
Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.
A Java library for extracting metadata from various media file formats, including images, videos, and audio.
An Android backup extractor tool written in Java for developers working with Android devices.
news-please is an integrated web crawler and information extractor for news that works out of the box.
A C# library for reading and extracting text and other content from PDF files, ported from the Java PDFBox library.
A go-based tool to process images with features like color palette extraction, OCR, upscaling, and more.
A Node.js library for extracting the main article content from a given URL using the Readability algorithm.
A Java library for extracting data from various streaming platforms like YouTube, SoundCloud, and Bandcamp.
An AI-powered tool that extracts knowledge and generates summaries from PDF books, page by page.
A Scala library for extracting HTML content and articles from web pages.
Get weekly updates on trending AI coding tools and projects.