Explore Projects

Discover 53 open source projects

Active filters (1):
Search: web-scrapingร—
Clear all

Showing 41-53 of 53 projects

oxylabs/how-to-scrape-google-scholar

A Python library for extracting titles, authors, and citations from Google Scholar using web scraping.

1.4K
Stable
Python
Backend & APIs
CLI Tools
Python
#google-scholar#web-scraping#web-crawler

tinyfish-io/agentql

AgentQL is a suite of tools for connecting your AI to the web, featuring a query language and Playwright integrations for web automation and data extraction.

1.3K
Active
Python
Agents & Orchestration
API Clients & Testing
Playwright
#web-automation#web-scraping#playwright-integration

rebrowser/rebrowser-patches

Collection of patches to avoid automation detection and captchas for web scraping and crawling tools.

1.3K
Experimental
JavaScript
Backend & APIs
CLI Tools
Puppeteer
#automation#web-scraping#headless

0x676e67/rnet

An ergonomic Python HTTP client with TLS fingerprinting capabilities for web scraping and crawling.

1.2K
Active
Rust
API Clients & Testing
Backend Frameworks
Rust
#http#https#tls

firecrawl/open-scouts

An open-source, AI-powered web monitoring platform that helps developers automate web searches and email alerts.

1.2K
Active
TypeScript
Agents & Orchestration
Email & Notifications
Next.js
#ai-agents#web-monitoring#email-alerts

lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

1.2K
Active
Jupyter Notebook
ETL & Pipelines
Backend Frameworks
Jupyter Notebook
#financial-analysis#web-scraping#data-pipeline

gildas-lormeau/single-file-cli

CLI tool for saving a complete web page as a single HTML file, useful for web archiving and scraping.

1.2K
Stable
JavaScript
CLI Tools
CLI Tools
Node.js
#web-scraping#web-archiving#cli

Decodo/Decodo

Residential proxy server with HTTP(S)/SOCKS5 rotation capabilities for web scraping and data collection.

1.2K
Stable
Java
API Clients & Testing
Caching
#web-scraping#data-collection#http-proxy

Kaliiiiiiiiii-Vinyzu/patchright-python

A stealthy Python implementation of the Playwright testing and automation library for web automation and scraping.

1.2K
Active
Python
Backend Frameworks
Testing
#automation#web-scraping#undetectable

intoli/user-agents

A JavaScript library for generating random user agents with daily updated data, useful for web scraping and browser automation.

1.1K
Stable
TypeScript
Backend & APIs
CLI Tools
JavaScript
#browser-automation#user-agent#web-scraping

juancarlospaco/faster-than-requests

A high-performance Python library for making HTTP requests faster than the standard requests library.

1.1K
Active
Nim
Backend & APIs
CLI Tools
#high-performance#web-scraping#http-requests

oxylabs/how-to-scrape-google-jobs

A Python library to scrape Google Jobs listings for multiple search queries and locations.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-scraping#google-search-api#serp-scraping

vifreefly/kimuraframework

A modern Ruby web scraping framework that can interact with JavaScript-rendered websites using headless browsers.

1.1K
Active
Ruby
Backend Frameworks
CLI Tools
#web-scraping#headless-browser#ruby

Stay in the loop

Get weekly updates on trending AI coding tools and projects.