Showing 1-17 of 17 projects
Convert websites into LLM-ready data with API for scraping, crawling, and structured data extraction
AI-powered web scraping library for extracting data from websites and documents
Web scraping and browser automation library for Node.js
A distributed web crawler admin platform for managing spiders in any language or framework.
A no-code web crawler platform that allows developers to define crawling workflows visually without writing code.
Crawlee is a powerful web scraping and browser automation library for Python to build reliable crawlers.
A comprehensive collection of web crawlers and scrapers in various programming languages.
A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.
Firecrawl MCP Server adds powerful web scraping and search capabilities to AI language models like Cursor and Claude.
A comprehensive toolkit for information gathering and reconnaissance, including OSINT, web crawling, and more.
Apache Nutch is an extensible and scalable web crawler for building search engines and data mining applications.
A cross-platform, fast, and flexible C# web crawler framework for developers building crawlers and spiders.
A simple and easy-to-use Python web scraping framework with support for multi-threading and proxies.
An internet search engine focused on indexing the small, old, and weird parts of the web.
CLI tool for saving a complete web page as a single HTML file, useful for web archiving and scraping.
A TypeScript-based tool for finding and fixing broken links in websites, documentation, and local files.
A TypeScript library to detect bots, crawlers, and spiders based on their user agent string.
Get weekly updates on trending AI coding tools and projects.