Showing 41-60 of 170 projects
A Redis-based distributed scraping library for the Scrapy web crawling framework.
A high-availability distributed IP proxy pool powered by Scrapy and Redis for web crawling applications.
Gathers text and metadata from the web using crawling, scraping, and extraction techniques.
A collection of Python-based web crawlers for scraping data from various e-commerce and online platforms.
Python API for crawling and downloading content from the JMComic website, a popular source for adult manga/comics.
A fast, simple web crawler designed for quick discovery of endpoints and assets within a web application.
This GitHub repository contains a collection of interesting Python web scraping and data analysis projects.
Analysis of bot protection systems and techniques to bypass browser fingerprinting for web scraping.
A distributed web crawler for Weibo, built using Celery and Requests.
A repository that collects news, resources, and legal regulations related to web crawlers in China.
A community-driven platform to read and chat with AI bots powered by ChatGPT for developers.
This is an archived repository that provides a web crawler and search engine for the now-defunct Wooyun security vulnerability database.
A Python-based web crawler that can scrape data, images, and videos from Weibo, a popular social media platform in China.
Arachni is a powerful open-source web application security scanner framework for penetration testing and vulnerability detection.
A JavaScript crawler that downloads comics, novels, and webcomics from various online sources.
Self-hosted BitTorrent indexer, crawler, classifier and search engine with web UI and API
Headless Chrome .NET API for web automation, crawling, and end-to-end testing
DedSecInside/TorBot is a dark web OSINT tool written in Python that crawls and extracts information from the Tor network.
A Python crawler tutorial for beginners, intermediate, and advanced users.
A list of AI agents and robots to block, useful for privacy-conscious developers
Get weekly updates on trending AI coding tools and projects.