Showing 1-6 of 6 projects
Converts complex documents into LLM-ready formats for agentic workflows
A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.
A NodeJS-based web crawler/spider for extracting data from websites using cheerio and jQuery.
Meltano is a declarative, code-first data integration engine for building and scaling data and ML-powered products.
Open-source platform for extracting structured data from documents using AI.
Crawly is a high-level web crawling and scraping framework for Elixir, enabling developers to extract data from websites efficiently.
Get weekly updates on trending AI coding tools and projects.