Showing 21-40 of 96 projects
Declarative web scraping library written in Go, providing a powerful DSL for extracting data from websites.
A powerful, distributed web crawler powered by Headless Chrome for scraping websites at scale.
A Redis-based distributed scraping library for the Scrapy web crawling framework.
A high-availability distributed IP proxy pool powered by Scrapy and Redis for web crawling applications.
Gathers text and metadata from the web using crawling, scraping, and extraction techniques.
Python API for crawling and downloading content from the JMComic website, a popular source for adult manga/comics.
A fast, simple web crawler designed for quick discovery of endpoints and assets within a web application.
A Go-based tool to fetch known URLs from various threat intelligence sources for security analysis.
This is a comprehensive content management system for novels, including features like recommendation, search, reading, and more.
This is an archived repository that provides a web crawler and search engine for the now-defunct Wooyun security vulnerability database.
Powerful scraping framework to build undetectable web scrapers using Python
A TypeScript-based server for web search and web crawling, part of the Exa MCP ecosystem.
Headless Chrome .NET API for web automation, crawling, and end-to-end testing
DedSecInside/TorBot is a dark web OSINT tool written in Python that crawls and extracts information from the Tor network.
A list of AI agents and robots to block, useful for privacy-conscious developers
A comprehensive toolkit for information gathering and reconnaissance, including OSINT, web crawling, and more.
Apache Nutch is an extensible and scalable web crawler for building search engines and data mining applications.
An open-source web crawler framework written in Java that makes it easy to build multi-threaded web crawlers.
This is the official repository for the classic roguelike game Dungeon Crawl: Stone Soup, not a vibe coder platform.
Geziyor is a fast web crawling and scraping framework for Go that supports JavaScript rendering.
Get weekly updates on trending AI coding tools and projects.