Explore Projects

Discover 162 open source projects

Active filters (1):
Search: scraper×
Clear all

Showing 141-160 of 162 projects

cnbattle/douyin

A Go-based web scraper for Douyin (TikTok) video content using virtual machines or real devices.

1.2K
Stable
Go
API Frameworks
Backend Frameworks
Go
#web-scraper#tiktok#douyin

gildas-lormeau/single-file-cli

CLI tool for saving a complete web page as a single HTML file, useful for web archiving and scraping.

1.2K
Stable
JavaScript
CLI Tools
CLI Tools
Node.js
#web-scraping#web-archiving#cli

egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

1.2K
Stable
Python
Data Scraper
CLI Tools
Python
#youtube#data-scraping#comments

Sniper970119/dianping_spider

A web scraper for the popular Chinese review platform Dianping, with solutions for dynamic font encryption and non-OCR approaches.

1.2K
Archived
Python
Backend Frameworks
Web Scraping
Python
#web-scraping#dynamic-font-encryption#non-ocr

k1995/BaiduyunSpider

A web scraper for searching and accessing files on the Baidu Cloud platform.

1.2K
Archived
JavaScript
Backend Frameworks
Search
Node
#web-scraper#baidu-cloud#file-search

Decodo/Decodo

Residential proxy server with HTTP(S)/SOCKS5 rotation capabilities for web scraping and data collection.

1.2K
Stable
Java
API Clients & Testing
Caching
#web-scraping#data-collection#http-proxy

TheBeastLT/torrentio-scraper

This is a JavaScript-based scraper for the Torrentio streaming platform, likely used for aggregating and indexing torrent metadata.

1.2K
Active
JavaScript
API Frameworks
Backend Frameworks
Node
#torrent#scraper#streaming

holgerd77/django-dynamic-scraper

A Django-based platform for creating Scrapy scrapers through the admin interface, useful for web scraping tasks.

1.2K
Archived
Python
API Frameworks
ORMs & Query Builders
Django
#scraper#scraping#crawler

KEV0143/Parser-Chitai-Gorod

A high-speed, intelligent web scraper for the Chitai-Gorod book catalog, enabling structured data collection.

1.2K
Experimental
Python
Backend & APIs
CLI Tools
Python
#web-scraping#data-extraction#book-catalog

raawaa/jav-scrapy

A TypeScript-based web scraper for fetching adult video magnet links and cover images in bulk.

1.2K
Stable
TypeScript
Backend Frameworks
Web Scraping
TypeScript
#web-scraping#magnet-links#adult-content

shadowmoose/RedditDownloader

A Python-based tool to scrape Reddit and download media content from the platform.

1.2K
Archived
Python
API Frameworks
Backend Frameworks
Python
#reddit#scraper#media-downloader

eracle/OpenOutreach

A Python-based LinkedIn automation tool for visiting profiles, connecting, and messaging using AI.

1.1K
Active
Python
LLM Wrappers & SDKs
Backend Frameworks
Playwright
#linkedin-automation#outreach#marketing-automation

oxylabs/how-to-bypass-amazon-captcha

A Python library for bypassing Amazon CAPTCHAs using the Oxylabs Amazon Scraper API.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#amazon#captcha-solving#scraper-api

juancarlospaco/faster-than-requests

A high-performance Python library for making HTTP requests faster than the standard requests library.

1.1K
Active
Nim
Backend & APIs
CLI Tools
#high-performance#web-scraping#http-requests

vesche/scanless

An online port scan scraper written in Python for penetration testing and security research.

1.1K
Archived
Python
Penetration Testing
CLI Tools
#port-scanner#scraper#security-research

tholian-network/stealth

Stealth is a secure, peer-to-peer, private, and automatable web browser/scraper/proxy for developers who value privacy.

1.1K
Archived
JavaScript
Privacy Tools
Backend Frameworks
Node.js
#anonymity#privacy-protection#web-browser

oxylabs/how-to-scrape-google-jobs

A Python library to scrape Google Jobs listings for multiple search queries and locations.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-scraping#google-search-api#serp-scraping

vifreefly/kimuraframework

A modern Ruby web scraping framework that can interact with JavaScript-rendered websites using headless browsers.

1.1K
Active
Ruby
Backend Frameworks
CLI Tools
#web-scraping#headless-browser#ruby

yjl9903/AnimeGarden

An open-source platform for aggregating and accessing anime/animation torrent resources through a web interface and API.

1.1K
Active
TypeScript
API Frameworks
Frontend Frameworks
React
#anime#torrent#scraper

elixir-crawly/crawly

Crawly is a high-level web crawling and scraping framework for Elixir, enabling developers to extract data from websites efficiently.

1.1K
Experimental
Elixir
Backend Frameworks
Caching
#crawler#crawling#scraper
1...79

Stay in the loop

Get weekly updates on trending AI coding tools and projects.