Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scrapingร—
Clear all

Showing 81-100 of 255 projects

postaddictme/instagram-php-scraper

A PHP library to scrape data from Instagram, including accounts, photos, videos, stories, and comments.

3.3K
Experimental
PHP
API Clients & Testing
Backend Frameworks
#instagram#scraping#api-client

opsdisk/pagodo

A Python tool to automate Google Hacking Database scraping and searching for bug bounty and OSINT purposes.

3.3K
Stable
Python
Security Research
CLI Tools
#bugbounty#google-dork#google-hacking-database

scrapy-plugins/scrapy-splash

A Scrapy plugin that integrates Splash, a headless web browser, for JavaScript rendering and scraping.

3.2K
Experimental
Python
Backend Frameworks
CLI Tools
Scrapy
#headless-browsers#scraping#javascript-rendering

scrapy/scrapyd

A service daemon to run Scrapy spiders, a powerful web scraping library in Python.

3.1K
Active
Python
API Frameworks
CLI Tools
Python
#web-scraping#crawling#automation

cporter202/scraping-apis-for-devs

This repository provides a collection of scraping APIs for developers to build automations and applications.

3.0K
Active
JavaScript
API Clients & Testing
Backend Frameworks
Node
#api#scraping#automation

x0rz/tweets_analyzer

A Python-based tool for scraping and analyzing Twitter user and tweet metadata.

3.0K
Archived
Python
Backend Frameworks
Databases
Python
#twitter#data-analysis#scraping

x4nth055/pythoncode-tutorials

A collection of Python tutorials covering a wide range of topics from computer vision to network security.

3.0K
Stable
Jupyter Notebook
Tutorials & Courses
ETL & Pipelines
#python#tutorials#machine-learning

oxylabs/google-ai-mode-scraper

Scrape Google AI Mode responses without blocks on a large scale using Java.

3.0K
Stable
Java
React
#authentication#streaming#real-time

TheBlewish/Automated-AI-Web-Researcher-Ollama

A Python program that turns an LLM into an automated web researcher, scraping content and saving findings.

3.0K
Archived
Python
LLM Frameworks
API Frameworks
Python
#llm#web-scraping#research-automation

itsOwen/CyberScraper-2077

Powerful web scraper using LLM and AI

2.9K
Active
Python
AI-powered web scraping tools
OpenAI
#ai-scraping#llm-scraper#web-scraper

ptwobrussell/Mining-the-Social-Web-2nd-Edition

An official compendium for the book 'Mining the Social Web' focused on web scraping and data analysis.

2.9K
Archived
HTML
Backend Frameworks
ETL & Pipelines
#web-scraping#data-analysis#book-companion

5ime/video_spider

A PHP library for scraping and removing watermarks from various video platforms.

2.9K
Stable
PHP
API Frameworks
Web Scrapers
#video-scraping#watermark-removal#video-platforms

topfunky/hpple

An Objective-C XML/HTML parsing library inspired by Hpricot, useful for web scraping and data extraction.

2.9K
Archived
Objective-C
Backend Frameworks
General Utilities
#html-parsing#xml-parsing#web-scraping

GhostenEditor/Ghosten-Player

A video player that supports direct connection to network disk, metadata scraping, IPTV, and file management.

2.8K
Active
Dart
API Frameworks
Component Libraries (Flutter)
Flutter
#video-player#iptv#aliyundrive

NikolaiT/GoogleScraper

A Python module to scrape several search engines, including asynchronous networking support.

2.8K
Archived
HTML
Python
#search-engine-scraping#asynchronous-networking#python-module

DormyMo/SpiderKeeper

An open-source admin UI for the Scrapy web scraping framework, providing a dashboard for managing and monitoring spiders.

2.8K
Archived
Python
API Frameworks
CLI Tools
Django
#web-scraping#scrapy#dashboard

geziyor/geziyor

Geziyor is a fast web crawling and scraping framework for Go that supports JavaScript rendering.

2.8K
Experimental
Go
API Frameworks
CLI Tools
#crawler#scraper#web-scraping

jeanphix/Ghost.py

Ghost.py is a Webkit based scriptable web browser for Python, allowing web automation and scraping.

2.8K
Archived
Python
Backend Frameworks
CLI Tools
Python
#web-automation#web-scraping#browser-automation

facundoolano/google-play-scraper

A Node.js library for scraping data from the Google Play Store.

2.8K
Stable
JavaScript
API Clients & Testing
Backend Frameworks
Node.js
#google-play#scraper#data-extraction

any4ai/AnyCrawl

AnyCrawl is a Node.js/TypeScript web scraper that extracts structured data from search engines and websites for use in AI/LLM applications.

2.8K
Active
TypeScript
LLM Wrappers & SDKs
Backend Frameworks
Node.js
#web-scraper#serp#data-extraction
1...46...13

Stay in the loop

Get weekly updates on trending AI coding tools and projects.