Explore Projects

Discover 162 open source projects

Active filters (1):
Search: scraper×
Clear all

Showing 21-40 of 162 projects

guyueyingmu/avbook

An adult video management system with a web crawler, database, and magnet link library for Japanese adult videos.

9.9K
Archived
PHP
API Frameworks
Laravel
#adult-video#crawler#database

FriendsOfPHP/Goutte

Goutte is a simple PHP web scraper library that allows developers to automate web interactions.

9.2K
Archived
PHP
API Clients & Testing
#web-scraper#automation#parsing

hardikvasa/google-images-download

A Python script to download hundreds of images from Google Images.

8.7K
Archived
Python
React
#image-download#google-images#python-script

apify/crawlee-python

Crawlee is a powerful web scraping and browser automation library for Python to build reliable crawlers.

8.2K
Active
Python
API Clients & Testing
Backend Frameworks
Playwright
#web-scraping#crawling#automation

TeamWiseFlow/wiseflow

A Python-based platform that uses LLMs to track and extract websites, RSS feeds, and social media for developers.

8.1K
Active
Python
LLM Frameworks
Backend Frameworks
Python
#crawler#information-gathering#information-tracker

BruceDone/awesome-crawler

A comprehensive collection of web crawlers and scrapers in various programming languages.

7.1K
Archived
Backend Frameworks
CLI Tools
#web-crawler#web-scraper#scraper

alirezamika/autoscraper

A powerful, lightweight web scraping library for Python that can automate data extraction from websites.

7.1K
Experimental
Python
Backend & APIs
CLI Tools
Python
#web-scraping#automation#data-extraction

TonyChen56/WeChatRobot

A C++ library that provides hooks and APIs for building WeChat robots and scrapers.

7.1K
Stable
C++
API Frameworks
Realtime
#wechat#wechatapi#wechatrobot

go-rod/rod

A Go library for automating and scraping websites using the Chrome DevTools Protocol.

6.8K
Stable
Go
Backend Frameworks
Testing
#automation#web-scraping#chrome-devtools

dilame/instagram-private-api

A TypeScript SDK for interacting with the Instagram private API, enabling web and mobile app development.

6.4K
Archived
TypeScript
API Clients & Testing
Backend Frameworks
TypeScript
#instagram#api-client#web-scraping

subzeroid/instagrapi

A fast and powerful Python library for interacting with the Instagram Private API, including features like automation and scraping.

5.9K
Active
Python
API Clients & Testing
Backend Frameworks
Python
#instagram#api-wrapper#automation

MontFerret/ferret

Declarative web scraping library written in Go, providing a powerful DSL for extracting data from websites.

5.9K
Stable
Go
Backend Frameworks
CLI Tools
#web-scraping#crawler#data-mining

matthewmueller/x-ray

A versatile and powerful web scraping library for JavaScript, designed to help developers extract data from the web with ease.

5.9K
Active
JavaScript
Frontend Frameworks
API Frameworks
Node.js
#web-scraping#data-extraction#crawling

yujiosaka/headless-chrome-crawler

A powerful, distributed web crawler powered by Headless Chrome for scraping websites at scale.

5.7K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-crawler#headless-chrome#scraper

JustAnotherArchivist/snscrape

A Python library for scraping data from various social media platforms.

5.3K
Archived
Python
API Clients & Testing
Backend Frameworks
Python
#social-media-scraping#data-extraction#api-client

tiagozip/cap

A self-hosted CAPTCHA solution for modern web applications

5.0K
Active
JavaScript
JavaScript
#captcha#proof-of-work#anti-bot

drawrowfly/tiktok-scraper

TikTok Scraper in TypeScript for downloading video posts and collecting metadata

5.0K
Archived
TypeScript
React
#authentication#streaming#real-time

niespodd/browser-fingerprinting

Analysis of bot protection systems and techniques to bypass browser fingerprinting for web scraping.

5.0K
Archived
JavaScript
Security Research
Authentication
Node.js
#bot-detection#browser-fingerprinting#web-scraping

jaypyles/Scraperr

Scraperr is a self-hosted web scraper built with TypeScript, Docker, and Kubernetes for efficient, scalable data extraction.

4.9K
Stable
TypeScript
API Frameworks
CLI Tools
TypeScript
#web-scraping#self-hosted#kubernetes

Yuukiy/JavSP

A Python-based scraper that aggregates metadata from multiple sites for Japanese adult videos (JAV).

4.8K
Experimental
Python
API Frameworks
Databases
#jav#metadata#web-scraping
13...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.