Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scraping×
Clear all

Showing 41-60 of 255 projects

go-rod/rod

A Go library for automating and scraping websites using the Chrome DevTools Protocol.

6.8K
Stable
Go
Backend Frameworks
Testing
#automation#web-scraping#chrome-devtools

autoscrape-labs/pydoll

Pydoll is a Python library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

6.6K
Active
Python
Frontend Frameworks
API Frameworks
#automation#browser-automation#scraping

xiangyuecn/AreaCity-JsSpider-StatsGov

Comprehensive collection of city and administrative region data for China, with features like CSV export, JS code generation, and web scraping.

6.4K
Stable
JavaScript
Databases
CLI Tools
JavaScript
#city-data#administrative-regions#scraping

subzeroid/instagrapi

A fast and powerful Python library for interacting with the Instagram Private API, including features like automation and scraping.

5.9K
Active
Python
API Clients & Testing
Backend Frameworks
Python
#instagram#api-wrapper#automation

MontFerret/ferret

Declarative web scraping library written in Go, providing a powerful DSL for extracting data from websites.

5.9K
Stable
Go
Backend Frameworks
CLI Tools
#web-scraping#crawler#data-mining

matthewmueller/x-ray

A versatile and powerful web scraping library for JavaScript, designed to help developers extract data from the web with ease.

5.9K
Active
JavaScript
Frontend Frameworks
API Frameworks
Node.js
#web-scraping#data-extraction#crawling

xchaoinfo/fuck-login

A Python library that provides a simple way to log in to various websites, enabling web scraping.

5.9K
Archived
Python
API Frameworks
Backend Frameworks
Python
#login#scraping#web-automation

daijro/camoufox

An anti-detect browser library built in C++ for web scraping and automation tasks.

5.8K
Active
C++
Frontend Frameworks
API Frameworks
Playwright
#antidetect#fingerprint#scraping

yujiosaka/headless-chrome-crawler

A powerful, distributed web crawler powered by Headless Chrome for scraping websites at scale.

5.7K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-crawler#headless-chrome#scraper

firecrawl/firecrawl-mcp-server

Firecrawl MCP Server adds powerful web scraping and search capabilities to AI language models like Cursor and Claude.

5.7K
Active
JavaScript
MCP Servers
LLM Wrappers & SDKs
JavaScript
#web-scraping#search-api#llm-integration

rmax/scrapy-redis

A Redis-based distributed scraping library for the Scrapy web crawling framework.

5.6K
Archived
Python
API Frameworks
Caching
Scrapy
#crawler#distributed#redis

adbar/trafilatura

Gathers text and metadata from the web using crawling, scraping, and extraction techniques.

5.4K
Stable
Python
React
#web-scraping#text-extraction#metadata-gathering

DropsDevopsOrg/ECommerceCrawlers

A collection of Python-based web crawlers for scraping data from various e-commerce and online platforms.

5.4K
Archived
Python
Backend Frameworks
ETL & Pipelines
Scrapy
#web-scraping#data-extraction#e-commerce

JustAnotherArchivist/snscrape

A Python library for scraping data from various social media platforms.

5.3K
Archived
Python
API Clients & Testing
Backend Frameworks
Python
#social-media-scraping#data-extraction#api-client

google/gumbo-parser

A pure C99 library for parsing HTML5 documents, useful for web scraping and content extraction projects.

5.2K
Active
HTML
Frontend Frameworks
API Frameworks
React
#html-parsing#web-scraping#content-extraction

spatie/browsershot

A PHP library that allows you to convert HTML to an image, PDF, or string for web scraping and automation.

5.2K
Active
PHP
Backend Frameworks
CLI Tools
#web-scraping#automation#pdf

lexiforest/curl_cffi

A Python library that can impersonate browser fingerprints for web scraping and HTTP requests.

5.1K
Active
Python
Backend & APIs
CLI Tools
Python
#web-scraping#http-client#fingerprinting

Alfred1984/interesting-python

This GitHub repository contains a collection of interesting Python web scraping and data analysis projects.

5.0K
Archived
Jupyter Notebook
Backend Frameworks
ETL & Pipelines
#web-scraping#data-analysis#python

niespodd/browser-fingerprinting

Analysis of bot protection systems and techniques to bypass browser fingerprinting for web scraping.

5.0K
Archived
JavaScript
Security Research
Authentication
Node.js
#bot-detection#browser-fingerprinting#web-scraping

pinchtab/pinchtab

High-performance Go orchestrator for headless Chrome automation with stealth injection and real-time dashboard.

4.9K
Active
Go
Browser Automation SDKs
Testing
Chrome DevTools Protocol
#go-orchestrator#cdp-bridge#stealth-injection
124...13

Stay in the loop

Get weekly updates on trending AI coding tools and projects.