Showing 81-100 of 170 projects
AI-powered web scraping and data gathering SDK for building intelligent agents and LLM apps
A lightweight web crawler built with Java for easy use
A powerful web scraping framework for Python that supports asynchronous crawling and flexible data extraction.
Daily arXiv paper crawler with AI summaries & GitHub Pages visualization for research discovery.
A collection of leaked GPT prompts and tools to bypass subscription limits and try out AI models.
A Python library for building web crawlers and spiders, suitable for vibe coders interested in web automation.
news-please is an integrated web crawler and information extractor for news that works out of the box.
A Java-based stock information crawler for the XueQiu platform
A PHP library for detecting bots, crawlers, and spiders based on the user agent string.
A powerful Python-based web scraper for extracting data from Weibo, a popular Chinese social media platform.
A cross-platform, fast, and flexible C# web crawler framework for developers building crawlers and spiders.
A powerful web scraping and crawling library for Rust developers
Flexible event-driven web crawler for Node.js, useful for building custom web scraping solutions.
A powerful Go-based website cloning tool that can clone websites to your computer within seconds.
A distributed, agile Java-based web crawler framework that can be used in SpringBoot applications.
Rendora is a dynamic server-side rendering solution using headless Chrome for SEO optimization.
Real-time COVID-19 infection data crawler and API for developers tracking the pandemic.
A Python tool to find web directories without bruteforcing, useful for security researchers and penetration testers.
A media downloader for various social and adult platforms, including Twitter, Reddit, and OnlyFans.
This repository provides a systematic guide for learning how to build Python web crawlers.
Get weekly updates on trending AI coding tools and projects.