Explore Projects

Discover 170 open source projects

Active filters (1):

Search: crawlers×

Clear all

Showing 81-100 of 170 projects

oxylabs/oxylabs-ai-studio-py

AI-powered web scraping and data gathering SDK for building intelligent agents and LLM apps

2.5K

Stable

Python

LLM Frameworks

AI SDKs & Wrappers

Python

#ai-crawler#ai-scraper#web-scraping

xtuhcy/gecco

A lightweight web crawler built with Java for easy use

2.5K

Active

Java

#crawler#lightweight#web-crawler

lorien/grab

A powerful web scraping framework for Python that supports asynchronous crawling and flexible data extraction.

2.5K

Stable

Python

Backend Frameworks

CLI Tools

Python

#web-scraping#crawling#asynchronous

dw-dengwei/daily-arXiv-ai-enhanced

Daily arXiv paper crawler with AI summaries & GitHub Pages visualization for research discovery.

2.4K

Active

JavaScript

LLM Wrappers & SDKs

Resource Collections

GitHub Pages

#arxiv-crawler#ai-summarization#research-papers

friuns2/Leaked-GPTs

A collection of leaked GPT prompts and tools to bypass subscription limits and try out AI models.

2.4K

Archived

Python

LLM Frameworks

Python

#ai#chatgpt#gpt

Python3WebSpider/Python3WebSpider

A Python library for building web crawlers and spiders, suitable for vibe coders interested in web automation.

2.4K

Archived

Backend Frameworks

CLI Tools

#web-scraping#automation#python

fhamborg/news-please

news-please is an integrated web crawler and information extractor for news that works out of the box.

2.4K

Stable

Python

API Frameworks

Web Crawlers

#news#web-crawler#data-extraction

decaywood/XueQiuSuperSpider

A Java-based stock information crawler for the XueQiu platform

2.3K

Archived

Java

None

React

#stock-crawler#XueQiu#AI-powered

JayBizzle/Crawler-Detect

A PHP library for detecting bots, crawlers, and spiders based on the user agent string.

2.3K

Active

PHP

API Frameworks

CLI Tools

PHP

#bots#crawler#user-agent

lucasjinreal/weibo_terminater

A powerful Python-based web scraper for extracting data from Weibo, a popular Chinese social media platform.

2.3K

Archived

Python

Backend & APIs

Scraping & ETL

Python

#web-scraper#social-media-data#chinese-corpus

sjdirect/abot

A cross-platform, fast, and flexible C# web crawler framework for developers building crawlers and spiders.

2.3K

Archived

Backend Frameworks

#web-crawler#cross-platform#c-sharp

spider-rs/spider

A powerful web scraping and crawling library for Rust developers

2.3K

Active

Rust

API Frameworks

CLI Tools

#automation#crawler#headless-chrome

simplecrawler/simplecrawler

Flexible event-driven web crawler for Node.js, useful for building custom web scraping solutions.

2.1K

Archived

JavaScript

Backend Frameworks

CLI Tools

Node.js

#web-scraping#crawling#http-client

goclone-dev/goclone

A powerful Go-based website cloning tool that can clone websites to your computer within seconds.

2.0K

Active

Backend Frameworks

CLI Tools

#website-cloning#website-scraping#go-lang

zhegexiaohuozi/SeimiCrawler

A distributed, agile Java-based web crawler framework that can be used in SpringBoot applications.

2.0K

Archived

Java

API Frameworks

CLI Tools

#web-crawler#distributed-crawling#spring-boot

rendora/rendora

Rendora is a dynamic server-side rendering solution using headless Chrome for SEO optimization.

2.0K

Archived

Frontend Frameworks

API Frameworks

Angular

#dynamic-rendering#ssr#seo-optimization

BlankerL/DXY-COVID-19-Crawler

Real-time COVID-19 infection data crawler and API for developers tracking the pandemic.

2.0K

Experimental

Python

API Frameworks

Realtime

Python

#covid-19#crawler#realtime-api

Nekmo/dirhunt

A Python tool to find web directories without bruteforcing, useful for security researchers and penetration testers.

2.0K

Archived

Python

Penetration Testing

CLI Tools

Python

#crawler#dirscanner#security-tools

AAndyProgram/SCrawler

A media downloader for various social and adult platforms, including Twitter, Reddit, and OnlyFans.

1.9K

Active

Visual Basic .NET

General Utilities

Realtime

Visual Basic .NET

#crawler#download#media

Ehco1996/Python-crawler

This repository provides a systematic guide for learning how to build Python web crawlers.

1.9K

Archived

HTML

Backend Frameworks

CLI Tools

#web-scraping#python-crawler#web-automation

1...46...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.