Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scrape×
Clear all

Showing 201-220 of 255 projects

kkangert/kspider

Kspider is a graphical web scraping and automation platform that allows developers to create scraping flows without coding.

1.3K
Archived
Java
Backend Frameworks
CLI Tools
#web-scraping#automation#no-code

Altimis/Scweet

Scweet is a powerful Python library for scraping tweets, followers, following, likes, and other Twitter data.

1.3K
Experimental
Python
API Clients & Testing
Backend Frameworks
#twitter#scraping#data-extraction

raznem/parsera

Lightweight Python library for scraping websites using large language models (LLMs) and the Playwright browser automation tool.

1.3K
Stable
Python
LLM Frameworks
Backend Frameworks
Python
#ai#scraping#data-extraction

tinyfish-io/agentql

AgentQL is a suite of tools for connecting your AI to the web, featuring a query language and Playwright integrations for web automation and data extraction.

1.3K
Active
Python
Agents & Orchestration
API Clients & Testing
Playwright
#web-automation#web-scraping#playwright-integration

rebrowser/rebrowser-patches

Collection of patches to avoid automation detection and captchas for web scraping and crawling tools.

1.3K
Experimental
JavaScript
Backend & APIs
CLI Tools
Puppeteer
#automation#web-scraping#headless

submato/xhscrawl

A web scraping tool for collecting data from Xiaohongshu, Bilibili, and other Chinese social platforms.

1.3K
Experimental
Web Scraping
API Frameworks
#web-scraping#social-media-data#api

ityard/python-fxxk-spider

A Python crawler project for scraping free resources.

1.3K
Archived
Scrapy
#crawler#python#requests

0x676e67/rnet

An ergonomic Python HTTP client with TLS fingerprinting capabilities for web scraping and crawling.

1.2K
Active
Rust
API Clients & Testing
Backend Frameworks
Rust
#http#https#tls

c4tcom/Katana

A Python tool that enables advanced Google searches and web scraping using Google Dorks.

1.2K
Experimental
Python
CLI Tools
Backend Frameworks
Python
#web-scraping#google-dorks#security-research

bookstairs/bookhunter

A Go-based download tool for scraping ebooks from the internet.

1.2K
Stable
Go
API Frameworks
Backend Frameworks
#web-scraping#ebook-downloader#go-lang

syrusakbary/gdom

A Python library for DOM traversal and scraping using GraphQL queries.

1.2K
Archived
Python
GraphQL
Backend Frameworks
Python
#graphql#web-scraping#dom-manipulation

stanfordjournalism/search-script-scrape

A collection of 101 real-world web scraping exercises in Python 3 for data journalists.

1.2K
Archived
Python
Backend Frameworks
ETL & Pipelines
Python
#web-scraping#data-journalism#python-3

firecrawl/open-scouts

An open-source, AI-powered web monitoring platform that helps developers automate web searches and email alerts.

1.2K
Active
TypeScript
Agents & Orchestration
Email & Notifications
Next.js
#ai-agents#web-monitoring#email-alerts

cameron/squirt

A fast, extensible library for scraping and parsing web content, enabling speed reading experiences.

1.2K
Archived
JavaScript
Frontend Frameworks
API Frameworks
React
#web-scraping#parsing#speed-reading

istresearch/scrapy-cluster

A distributed, on-demand web scraping solution using Scrapy, Redis, and Kafka for high-performance crawling.

1.2K
Archived
Python
API Frameworks
Caching
Scrapy
#distributed-computing#web-scraping#high-performance

lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

1.2K
Active
Jupyter Notebook
ETL & Pipelines
Backend Frameworks
Jupyter Notebook
#financial-analysis#web-scraping#data-pipeline

monosans/proxy-scraper-checker

A Rust-based library to scrape and check the availability of HTTP, SOCKS4, and SOCKS5 proxies.

1.2K
Active
Rust
API Frameworks
CLI Tools
#proxy-scraper#proxy-checker#proxy-list

zhentaoo/puppeteer-deep

A powerful Puppeteer-based library for web scraping, automation, and performance analysis, focused on developers building with AI tools.

1.2K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node
#web-scraping#automation#performance-analysis

Minerchu/dongguaTV

An open-source media server platform that leverages AI and web scraping to provide a personalized Netflix-like streaming experience.

1.2K
Stable
HTML
CMS & Content
Search-as-a-Service
HTML
#media-server#streaming#web-scraping

mintapi/mintapi

An unofficial screen-scraping API for Mint.com, a popular personal finance management platform.

1.2K
Archived
Python
API Clients & Testing
API Frameworks
Python
#finance#personal-finance#screen-scraping
1...101213

Stay in the loop

Get weekly updates on trending AI coding tools and projects.