Explore Projects

Discover 33 open source projects

Active filters (1):
Search: scrapy×
Clear all

Showing 1-20 of 33 projects

scrapy/scrapy

Scrapy is a fast, high-level web crawling and scraping framework for Python developers.

60.6K
Active
Python
Testing
Python
#web-scraping#crawling#python

crawlab-team/crawlab

A distributed web crawler admin platform for managing spiders in any language or framework.

12.2K
Stable
Go
Backend Frameworks
Go
#crawling#spider-management#distributed

scrapinghub/portia

Portia is a visual scraping tool for Scrapy, a popular Python web scraping framework.

9.5K
Archived
Python
Backend Frameworks
Python
#web-scraping#data-extraction#data-pipeline

lining0806/PythonSpiderNotes

A Python-based web scraping library that provides code examples and techniques for various web scraping tasks.

7.4K
Archived
Python
Backend Frameworks
CLI Tools
Python
#web-scraping#python#tutorials

rmax/scrapy-redis

A Redis-based distributed scraping library for the Scrapy web crawling framework.

5.6K
Archived
Python
API Frameworks
Caching
Scrapy
#crawler#distributed#redis

SpiderClub/haipproxy

A high-availability distributed IP proxy pool powered by Scrapy and Redis for web crawling applications.

5.6K
Archived
Python
API Frameworks
Containerization
Scrapy
#crawler#distributed#high-availability

DropsDevopsOrg/ECommerceCrawlers

A collection of Python-based web crawlers for scraping data from various e-commerce and online platforms.

5.4K
Archived
Python
Backend Frameworks
ETL & Pipelines
Scrapy
#web-scraping#data-extraction#e-commerce

Boris-code/feapder

A powerful Python-based web scraping framework with features like browser rendering and data deduplication.

3.6K
Stable
Python
Backend Frameworks
CLI Tools
Python
#crawler#scraper#web-scraping

Gerapy/Gerapy

A distributed crawler management framework based on Scrapy, Scrapyd, Django, and Vue.js for web scraping.

3.5K
Archived
Python
API Frameworks
Backend Frameworks
Django
#scrapy#scrapyd#django

my8100/scrapydweb

Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert for developers building with AI tools.

3.4K
Experimental
Python
React
#Scrapyd#Scrapy#AI-powered

scrapy-plugins/scrapy-splash

A Scrapy plugin that integrates Splash, a headless web browser, for JavaScript rendering and scraping.

3.2K
Experimental
Python
Backend Frameworks
CLI Tools
Scrapy
#headless-browsers#scraping#javascript-rendering

scrapy/scrapyd

A service daemon to run Scrapy spiders, a powerful web scraping library in Python.

3.1K
Active
Python
API Frameworks
CLI Tools
Python
#web-scraping#crawling#automation

DormyMo/SpiderKeeper

An open-source admin UI for the Scrapy web scraping framework, providing a dashboard for managing and monitoring spiders.

2.8K
Archived
Python
API Frameworks
CLI Tools
Django
#web-scraping#scrapy#dashboard

QianyanTech/Image-Downloader

A Python library for downloading images from Google, Bing, and Baidu.

2.3K
Archived
Python
React
#image-downloader#google-images#pyqt

librauee/Reptile

A comprehensive Python web scraping library covering a wide range of popular websites and platforms.

1.7K
Archived
Python
Backend Frameworks
CLI Tools
#web-scraping#python3#requests

TheWebScrapingClub/webscraping-from-0-to-hero

A comprehensive resource for learning web scraping with Python, covering tools like Playwright, Scrapy, and Splash.

1.7K
Archived
Backend Frameworks
CLI Tools
Python
#web-scraping#python#playwright

aivarsk/scrapy-proxies

A random proxy middleware for the Scrapy web scraping framework in Python.

1.7K
Archived
Python
Backend & APIs
CLI Tools
Scrapy
#web-scraping#proxies#middleware

scrapy/dirbot

A deprecated Python-based web scraping library for educational public web directories.

1.6K
Archived
Python
Backend Frameworks
CLI Tools
#web-scraping#educational#public-directories

scrapy-plugins/scrapy-playwright

Scrapy-Playwright is a Python library that integrates the Playwright browser automation tool with the Scrapy web scraping framework.

1.4K
Active
Python
Backend Frameworks
CLI Tools
Python
#chrome-headless#firefox-headless#headless-browser

eliasdabbas/advertools

A Python library for online marketing and web analytics tools, including SEO, advertising, and social media.

1.4K
Stable
Python
Backend Frameworks
Search
Python
#advertising#digital-marketing#seo
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.