Explore Projects

Discover 170 open source projects

Active filters (1):
Search: crawlersร—
Clear all

Showing 121-140 of 170 projects

ArchiveTeam/grab-site

A web crawler tool that outputs WARC files and provides a dashboard for managing crawls.

1.6K
Experimental
Python
CLI Tools
Backend Frameworks
Python
#archiving#crawling#web-scraping

github/lightcrawler

A JavaScript tool that crawls a website and runs it through Google Lighthouse for performance, accessibility, and best practices analysis.

1.5K
Archived
JavaScript
Frontend Frameworks
CLI Tools
Node.js
#web-crawler#web-performance#accessibility-testing

oxylabs/how-to-scrape-google-images

Automated Google image scraper that retrieves and parses image data using HTTP requests.

1.5K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-image-api#image-scraping#web-scraping

danielmiessler/RobotsDisallowed

A curated list of commonly disallowed robots.txt directories, useful for web crawlers and SEO.

1.5K
Archived
Shell
CLI Tools
Backend Frameworks
#web-crawling#robots.txt#seo

Autumn-27/ScopeSentry

ScopeSentry is a comprehensive security tool for mapping cyberspace, enumerating subdomains, scanning ports, and identifying vulnerabilities.

1.5K
Active
Go
Security Research
CLI Tools
Go
#infosec#osint#vulnerability-scanning

keenwon/antcolony

A Node.js-based web scraper and crawler for finding and downloading torrent files.

1.5K
Archived
JavaScript
Backend Frameworks
API Frameworks
Node
#web-scraping#torrent#bittorrent

dadoonet/fscrawler

An Elasticsearch-powered file system crawler that can index content from various file formats.

1.4K
Active
Java
API Frameworks
Search
Java
#elasticsearch#crawler#file-indexing

openwpm/OpenWPM

OpenWPM is a web privacy measurement framework written in Python that can be used to crawl and analyze websites.

1.4K
Experimental
Python
Backend & APIs
CLI Tools
Python
#crawler#firefox#privacy

darbra/sperm

This is a collection of interesting reverse engineering articles worth checking out.

1.4K
Stable
Crawling & Scraping
Reverse Engineering
#crawl#crawler#frida

LeonardoCardoso/SwiftLinkPreview

A Swift library that generates previews from URLs, extracting titles, images, and relevant text.

1.4K
Archived
Swift
Component Libraries (Swift)
CLI Tools
Swift
#url-preview#web-crawler#swift-package-manager

lorey/mlscraper

Effortlessly scrape data from websites using machine learning and HTML examples with this Python library.

1.4K
Archived
Python
Backend Frameworks
ETL & Pipelines
#web-scraping#data-extraction#machine-learning

amirgamil/apollo

A personal search engine and web crawler for developers to explore their digital footprint.

1.4K
Archived
Go
CLI Tools
API Frameworks
Go
#personal-search#web-crawler#unix-like

monperrus/crawler-user-agents

A Go library that provides a database of syntactic patterns of HTTP user-agents used by bots/crawlers/scrapers.

1.4K
Active
Go
CLI Tools
API Frameworks
#user-agent#crawler#bot

felipecsl/wombat

Lightweight Ruby web crawler/scraper with an elegant DSL to extract structured data from pages.

1.4K
Stable
Ruby
Backend Frameworks
API Frameworks
#crawler#scraper#dsl

sec-edgar/sec-edgar

Downloads periodic reports, filings and forms from EDGAR database.

1.4K
Stable
Python
Python
#edgar-database#finance#python

eliasdabbas/advertools

A Python library for online marketing and web analytics tools, including SEO, advertising, and social media.

1.4K
Stable
Python
Backend Frameworks
Search
Python
#advertising#digital-marketing#seo

xisuo67/XHS-Spider

A web scraping and data collection tool for the Chinese social media platform Xiaohongshu (Little Red Book).

1.4K
Stable
Backend Frameworks
API Frameworks
C#
#crawler#downloader#scraper

kgspider/crawler

A JavaScript web scraper repository focused on reverse engineering and advanced crawling techniques.

1.3K
Experimental
JavaScript
Backend Frameworks
CLI Tools
Node.js
#crawler#web-scraper#reverse-engineering

scrapinghub/frontera

A scalable frontier for web crawlers, focused on high-performance and scalable web crawling.

1.3K
Experimental
Python
API Frameworks
Backend Frameworks
Python
#web-crawling#distributed-systems#high-performance

adyzng/jd-autobuy

A Python web scraper for automatically logging in and purchasing items on JD.com.

1.3K
Archived
Python
Backend & APIs
API Frameworks
Python
#crawler#scraper#jingdong
1...689

Stay in the loop

Get weekly updates on trending AI coding tools and projects.