Showing 1-20 of 96 projects
Convert websites into LLM-ready data with API for scraping, crawling, and structured data extraction
Scrapy is a fast, high-level web crawling and scraping framework for Python developers.
Go Colly - Elegant Scraper and Crawler Framework for Golang
DeerFlow is a super agent harness for orchestrating sub-agents, memory, and sandboxes in deep research workflows.
Powerful, flexible Python library for effortless web scraping with AI-powered features.
Crawl websites to create custom GPTs from URLs
Web scraping and browser automation library for Node.js
A highly customizable web crawler and spider framework for developers to build advanced crawling solutions.
A Python library for extracting news articles, full-text, and metadata from websites.
A Python library for crawling historical data of China stocks.
A distributed web crawler admin platform for managing spiders in any language or framework.
A no-code web crawler platform that allows developers to define crawling workflows visually without writing code.
Crawls and scrapes Weibo data using Python.
INFO-SPIDER is an open-source web scraping toolkit that helps users retrieve data from various sources like email, e-commerce, and social platforms.
Crawlee is a powerful web scraping and browser automation library for Python to build reliable crawlers.
A comprehensive list of libraries, tools, and APIs for web scraping and data processing.
Pholcus is a high-concurrency web crawler software written in Go for developers needing a powerful, distributed crawling solution.
Anti-Anti-Spider is a Python library that helps developers bypass anti-crawling measures on websites to collect data.
A Go library for automating and scraping websites using the Chrome DevTools Protocol.
A Python-based platform for aggregating and crawling proxy servers, useful for building proxy-reliant applications.
Get weekly updates on trending AI coding tools and projects.