Explore Projects

Discover 163 open source projects

Active filters (1):
Search: scrapersร—
Clear all

Showing 81-100 of 163 projects

lucasjinreal/weibo_terminater

A powerful Python-based web scraper for extracting data from Weibo, a popular Chinese social media platform.

2.3K
Archived
Python
Backend & APIs
Scraping & ETL
Python
#web-scraper#social-media-data#chinese-corpus

spider-rs/spider

A powerful web scraping and crawling library for Rust developers

2.3K
Active
Rust
API Frameworks
CLI Tools
#automation#crawler#headless-chrome

vladkens/twscrape

Scrapes Twitter API search results and user profiles with authorization support.

2.3K
Experimental
Python
React
#authentication#scraping#authorization

minimaxir/facebook-page-post-scraper

A Python scraper for extracting data from Facebook Page posts for statistical analysis.

2.1K
Archived
Python
API Clients & Testing
Backend Frameworks
Python
#facebook#scraper#data-analysis

AhmadIbrahiim/Website-downloader

A Node.js library that allows developers to download the complete source code and assets of any website for offline use.

2.1K
Stable
HTML
Backend Frameworks
API Frameworks
Node.js
#web-scraper#offline-web-pages#asset-downloader

goclone-dev/goclone

A powerful Go-based website cloning tool that can clone websites to your computer within seconds.

2.0K
Active
Go
Backend Frameworks
CLI Tools
Go
#website-cloning#website-scraping#go-lang

apify/fingerprint-suite

Browser fingerprinting tools for anonymizing scrapers

2.0K
Active
TypeScript
Playwright
#fingerprinting#anonymizing#scrapers

zerodytrash/TikTok-Live-Connector

A Node.js library to receive real-time events from TikTok LIVE streams, including comments and gifts.

1.9K
Stable
TypeScript
API Clients & Testing
Realtime
Node.js
#tiktok#live-stream#realtime

trevorhobenshield/twitter-api-client

Python library for interacting with Twitter's APIs, including v1, v2, and GraphQL.

1.9K
Archived
Python
API Clients & Testing
API Frameworks
#twitter#api#scrape

extractus/article-extractor

A Node.js library for extracting the main article content from a given URL using the Readability algorithm.

1.9K
Stable
JavaScript
API Frameworks
Backend Frameworks
Node
#article-extraction#web-scraping#readability

GodsScion/Auto_job_applier_linkedIn

Automate the LinkedIn job application process with this Python script that applies to jobs for you.

1.9K
Active
Python
API Frameworks
Backend Frameworks
Python
#auto-apply#linkedin#job-search

watercrawl/WaterCrawl

A versatile TypeScript-based tool that transforms web content into LLM-ready data for AI/ML applications.

1.8K
Active
TypeScript
LLM Wrappers & SDKs
Frontend Frameworks
TypeScript
#aicrawler#crawl4ai#crawler

TeamNewPipe/NewPipeExtractor

A Java library for extracting data from various streaming platforms like YouTube, SoundCloud, and Bandcamp.

1.8K
Active
Java
API Frameworks
Backend Frameworks
#crawler#extractor#scraper

coder-hxl/x-crawl

Flexible and AI-assisted Node.js crawler library for building web scrapers and crawlers.

1.8K
Active
TypeScript
LLM Frameworks
API Frameworks
Node.js
#ai-crawl#chromium#crawler

yhangf/PythonCrawler

A collection of Python web crawling projects for developers interested in building web scrapers and spiders.

1.8K
Experimental
Python
Backend Frameworks
CLI Tools
Python
#web-scraping#web-crawling#python3

alex/nyt-2020-election-scraper

This is a tool for scraping election data from the New York Times website.

1.8K
Archived
HTML
Frontend Frameworks
API Frameworks
HTML
#web-scraping#election-data#new-york-times

metafates/mangal

A simple and powerful CLI tool for downloading manga from various sources, including MangaDex integration.

1.7K
Experimental
Go
API Frameworks
CLI Tools
Go
#manga#downloader#anilist

website-scraper/node-website-scraper

A Node.js library for scraping websites and downloading their contents locally.

1.7K
Active
JavaScript
Backend Frameworks
CLI Tools
Node.js
#scraper#website-downloader#node.js

claffin/cloudproxy

A Python library that provisions proxy servers across cloud providers to improve web scraping success.

1.7K
Active
Python
API Clients & Testing
Containerization
#web-scraping#cloud-infrastructure#proxy-server

th3unkn0n/TeleGram-Scraper

A Python tool for scraping information from Telegram groups, including group member details.

1.6K
Archived
Python
CLI Tools
Information Retrieval
#telegram#scraper#information-gathering
1...46...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.