Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scraping×
Clear all

Showing 141-160 of 255 projects

howie6879/ruia

An async Python 3.6+ web scraping micro-framework based on asyncio for building high-performance crawlers and spiders.

1.7K
Archived
Python
Backend Frameworks
CLI Tools
Python
#web-scraping#asynchronous#asyncio

librauee/Reptile

A comprehensive Python web scraping library covering a wide range of popular websites and platforms.

1.7K
Archived
Python
Backend Frameworks
CLI Tools
#web-scraping#python3#requests

TheWebScrapingClub/webscraping-from-0-to-hero

A comprehensive resource for learning web scraping with Python, covering tools like Playwright, Scrapy, and Splash.

1.7K
Archived
Backend Frameworks
CLI Tools
Python
#web-scraping#python#playwright

website-scraper/node-website-scraper

A Node.js library for scraping websites and downloading their contents locally.

1.7K
Active
JavaScript
Backend Frameworks
CLI Tools
Node.js
#scraper#website-downloader#node.js

Python3Spiders/WeiboSuperSpider

A powerful Python-based web crawler and toolkit for scraping Weibo data, including user profiles, comments, images, and more.

1.7K
Archived
Python
Backend Frameworks
Databases
Python
#weibo#web-scraping#data-extraction

claffin/cloudproxy

A Python library that provisions proxy servers across cloud providers to improve web scraping success.

1.7K
Active
Python
API Clients & Testing
Containerization
#web-scraping#cloud-infrastructure#proxy-server

aivarsk/scrapy-proxies

A random proxy middleware for the Scrapy web scraping framework in Python.

1.7K
Archived
Python
Backend & APIs
CLI Tools
Scrapy
#web-scraping#proxies#middleware

Ge0rg3/requests-ip-rotator

A Python library to use AWS API Gateway's IP pool as a proxy for web scraping and security testing

1.6K
Experimental
Python
API Clients & Testing
Security Research
#api-gateway#aws#web-scraping

apurvsinghgautam/dark-web-osint-tools

A collection of OSINT tools for exploring the dark web, including scraping, search, and data extraction capabilities.

1.6K
Experimental
Security Research
Backend Frameworks
#osint#darkweb#scraping

scrapy/dirbot

A deprecated Python-based web scraping library for educational public web directories.

1.6K
Archived
Python
Backend Frameworks
CLI Tools
#web-scraping#educational#public-directories

th3unkn0n/TeleGram-Scraper

A Python tool for scraping information from Telegram groups, including group member details.

1.6K
Archived
Python
CLI Tools
Information Retrieval
#telegram#scraper#information-gathering

megadose/OnionSearch

OnionSearch is a Python script that scrapes URLs from different .onion search engines for open-source intelligence.

1.6K
Archived
Python
OSINT Tools
CLI Tools
Python
#open-source-intelligence#onion-search#web-scraping

justmarkham/DAT8

General Assembly's 2015 Data Science course covering topics like machine learning, data analysis, and data visualization.

1.6K
Archived
Jupyter Notebook
Tutorials & Courses
Jupyter Notebook
#data-analysis#data-science#machine-learning

saermart/DouyinLiveWebFetcher

A Python library for scraping real-time data from Douyin (TikTok) live streams, including comments and metadata.

1.6K
Stable
Python
API Frameworks
Backend Frameworks
#web-scraping#live-streaming#comments

probberechts/soccerdata

A Python library for scraping soccer data from various sources for sports analytics and data science.

1.6K
Active
Python
ETL & Pipelines
CLI Tools
#soccer-analytics#data-scraping#sports-data

paulpierre/informer

A Telegram bot for mass surveillance and data scraping, built with Python.

1.6K
Stable
Python
API Frameworks
CLI Tools
Python
#bot#scraper#surveillance

srx-2000/spider_collection

A collection of Python web scraping scripts for various websites and platforms, including music, video, and real estate data.

1.6K
Archived
Python
Backend Frameworks
ETL & Pipelines
#web-scraping#data-extraction#python-scripts

rushter/selectolax

A fast HTML5 parser with CSS selectors for Python, useful for web scraping and crawling tasks.

1.6K
Active
Cython
Frontend Frameworks
API Frameworks
Python
#web-scraping#html-parsing#css-selectors

johntitus/node-horseman

Run PhantomJS, a headless web browser, from Node.js for web scraping, testing, and automation.

1.6K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-scraping#testing#automation

headzoo/surf

Stateful programmatic web browsing in Go, a powerful tool for automation and scraping.

1.5K
Archived
Go
Backend Frameworks
API Frameworks
#web-scraping#automation#http-client
1...79...13

Stay in the loop

Get weekly updates on trending AI coding tools and projects.