Explore Projects

Discover 170 open source projects

Active filters (1):
Search: crawler×
Clear all

Showing 21-40 of 170 projects

code4craft/webmagic

A scalable web crawler framework for Java developers to build custom web scrapers and data extraction tools.

11.7K
Stable
Java
API Frameworks
#crawler#scraping#framework

ssssssss-team/spider-flow

A no-code web crawler platform that allows developers to define crawling workflows visually without writing code.

11.3K
Archived
Java
Web Development
Java
#crawler#jsoup#spider

injetlee/Python

Python scripts for web scraping, automating common tasks like login, Excel manipulation, and WeChat operations.

10.5K
Archived
Python
API Frameworks
#web-scraping#automation#excel

faisalman/ua-parser-js

A powerful user-agent detection library for client-side and server-side web development.

10.1K
Active
JavaScript
Frontend Frameworks
React
#analytics#bot-detection#browser-detection

guyueyingmu/avbook

An adult video management system with a web crawler, database, and magnet link library for Japanese adult videos.

9.9K
Archived
PHP
API Frameworks
Laravel
#adult-video#crawler#database

shmilylty/OneForAll

OneForAll is a powerful subdomain collection tool for security researchers and bug bounty hunters.

9.7K
Stable
Python
Penetration Testing
Python
#subdomain-enumeration#osint#pentest-tool

apify/crawlee-python

Crawlee is a powerful web scraping and browser automation library for Python to build reliable crawlers.

8.2K
Active
Python
API Clients & Testing
Backend Frameworks
Playwright
#web-scraping#crawling#automation

TeamWiseFlow/wiseflow

A Python-based platform that uses LLMs to track and extract websites, RSS feeds, and social media for developers.

8.1K
Active
Python
LLM Frameworks
Backend Frameworks
Python
#crawler#information-gathering#information-tracker

lorien/awesome-web-scraping

A comprehensive list of libraries, tools, and APIs for web scraping and data processing.

7.8K
Active
Makefile
Backend Frameworks
ETL & Pipelines
#web-scraping#crawling#data-processing

andeya/pholcus

Pholcus is a high-concurrency web crawler software written in Go for developers needing a powerful, distributed crawling solution.

7.6K
Archived
Go
API Frameworks
CLI Tools
#crawler#spider#distributed

BruceDone/awesome-crawler

A comprehensive collection of web crawlers and scrapers in various programming languages.

7.1K
Archived
Backend Frameworks
CLI Tools
#web-crawler#web-scraper#scraper

alirezamika/autoscraper

A powerful, lightweight web scraping library for Python that can automate data extraction from websites.

7.1K
Experimental
Python
Backend & APIs
CLI Tools
Python
#web-scraping#automation#data-extraction

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K
Stable
Python
LLM Frameworks
File Storage
Python
#ingestion-api#ocr#parser-library

bda-research/node-crawler

A NodeJS-based web crawler/spider for extracting data from websites using cheerio and jQuery.

6.8K
Experimental
TypeScript
React
#crawler#data-extraction#javascript

autoscrape-labs/pydoll

Pydoll is a Python library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

6.6K
Active
Python
Frontend Frameworks
API Frameworks
#automation#browser-automation#scraping

subzeroid/instagrapi

A fast and powerful Python library for interacting with the Instagram Private API, including features like automation and scraping.

5.9K
Active
Python
API Clients & Testing
Backend Frameworks
Python
#instagram#api-wrapper#automation

MontFerret/ferret

Declarative web scraping library written in Go, providing a powerful DSL for extracting data from websites.

5.9K
Stable
Go
Backend Frameworks
CLI Tools
#web-scraping#crawler#data-mining

00-Evan/shattered-pixel-dungeon

Shattered Pixel Dungeon is an open-source traditional roguelike dungeon crawler for Android and iOS.

5.9K
Active
Java
Android
iOS
LibGDX
#android#ios#roguelike

yujiosaka/headless-chrome-crawler

A powerful, distributed web crawler powered by Headless Chrome for scraping websites at scale.

5.7K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-crawler#headless-chrome#scraper

firecrawl/firecrawl-mcp-server

Firecrawl MCP Server adds powerful web scraping and search capabilities to AI language models like Cursor and Claude.

5.7K
Active
JavaScript
MCP Servers
LLM Wrappers & SDKs
JavaScript
#web-scraping#search-api#llm-integration
13...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.