Explore Projects

Discover 162 open source projects

Active filters (1):

Search: scraper×

Clear all

Showing 141-160 of 162 projects

cnbattle/douyin

A Go-based web scraper for Douyin (TikTok) video content using virtual machines or real devices.

1.2K

Stable

API Frameworks

Backend Frameworks

#web-scraper#tiktok#douyin

gildas-lormeau/single-file-cli

CLI tool for saving a complete web page as a single HTML file, useful for web archiving and scraping.

1.2K

Stable

JavaScript

CLI Tools

Node.js

#web-scraping#web-archiving#cli

egbertbouman/youtube-comment-downloader

Simple script for downloading YouTube comments without using the YouTube API.

1.2K

Stable

Python

Data Scraper

CLI Tools

Python

#youtube#data-scraping#comments

Sniper970119/dianping_spider

A web scraper for the popular Chinese review platform Dianping, with solutions for dynamic font encryption and non-OCR approaches.

1.2K

Archived

Python

Backend Frameworks

Web Scraping

Python

#web-scraping#dynamic-font-encryption#non-ocr

k1995/BaiduyunSpider

A web scraper for searching and accessing files on the Baidu Cloud platform.

1.2K

Archived

JavaScript

Backend Frameworks

Node

#web-scraper#baidu-cloud#file-search

Decodo/Decodo

Residential proxy server with HTTP(S)/SOCKS5 rotation capabilities for web scraping and data collection.

1.2K

Stable

Java

API Clients & Testing

Caching

#web-scraping#data-collection#http-proxy

TheBeastLT/torrentio-scraper

This is a JavaScript-based scraper for the Torrentio streaming platform, likely used for aggregating and indexing torrent metadata.

1.2K

Active

JavaScript

API Frameworks

Backend Frameworks

Node

#torrent#scraper#streaming

holgerd77/django-dynamic-scraper

A Django-based platform for creating Scrapy scrapers through the admin interface, useful for web scraping tasks.

1.2K

Archived

Python

API Frameworks

ORMs & Query Builders

Django

#scraper#scraping#crawler

KEV0143/Parser-Chitai-Gorod

A high-speed, intelligent web scraper for the Chitai-Gorod book catalog, enabling structured data collection.

1.2K

Experimental

Python

Backend & APIs

CLI Tools

Python

#web-scraping#data-extraction#book-catalog

raawaa/jav-scrapy

A TypeScript-based web scraper for fetching adult video magnet links and cover images in bulk.

1.2K

Stable

TypeScript

Backend Frameworks

Web Scraping

TypeScript

#web-scraping#magnet-links#adult-content

shadowmoose/RedditDownloader

A Python-based tool to scrape Reddit and download media content from the platform.

1.2K

Archived

Python

API Frameworks

Backend Frameworks

Python

#reddit#scraper#media-downloader

eracle/OpenOutreach

A Python-based LinkedIn automation tool for visiting profiles, connecting, and messaging using AI.

1.1K

Active

Python

LLM Wrappers & SDKs

Backend Frameworks

Playwright

#linkedin-automation#outreach#marketing-automation

oxylabs/how-to-bypass-amazon-captcha

A Python library for bypassing Amazon CAPTCHAs using the Oxylabs Amazon Scraper API.

1.1K

Stable

Python

API Clients & Testing

Backend Frameworks

Python

#amazon#captcha-solving#scraper-api

juancarlospaco/faster-than-requests

A high-performance Python library for making HTTP requests faster than the standard requests library.

1.1K

Active

Nim

Backend & APIs

CLI Tools

#high-performance#web-scraping#http-requests

vesche/scanless

An online port scan scraper written in Python for penetration testing and security research.

1.1K

Archived

Python

Penetration Testing

CLI Tools

#port-scanner#scraper#security-research

tholian-network/stealth

Stealth is a secure, peer-to-peer, private, and automatable web browser/scraper/proxy for developers who value privacy.

1.1K

Archived

JavaScript

Privacy Tools

Backend Frameworks

Node.js

#anonymity#privacy-protection#web-browser

oxylabs/how-to-scrape-google-jobs

A Python library to scrape Google Jobs listings for multiple search queries and locations.

1.1K

Stable

Python

API Clients & Testing

Backend Frameworks

Python

#google-scraping#google-search-api#serp-scraping

vifreefly/kimuraframework

A modern Ruby web scraping framework that can interact with JavaScript-rendered websites using headless browsers.

1.1K

Active

Ruby

Backend Frameworks

CLI Tools

#web-scraping#headless-browser#ruby

yjl9903/AnimeGarden

An open-source platform for aggregating and accessing anime/animation torrent resources through a web interface and API.

1.1K

Active

TypeScript

API Frameworks

Frontend Frameworks

React

#anime#torrent#scraper

elixir-crawly/crawly

Crawly is a high-level web crawling and scraping framework for Elixir, enabling developers to extract data from websites efficiently.

1.1K

Experimental

Elixir

Backend Frameworks

Caching

#crawler#crawling#scraper

1...79

Stay in the loop

Get weekly updates on trending AI coding tools and projects.