Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scraping×
Clear all

Showing 241-255 of 255 projects

oxylabs/how-to-scrape-google-jobs

A Python library to scrape Google Jobs listings for multiple search queries and locations.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-scraping#google-search-api#serp-scraping

vifreefly/kimuraframework

A modern Ruby web scraping framework that can interact with JavaScript-rendered websites using headless browsers.

1.1K
Active
Ruby
Backend Frameworks
CLI Tools
#web-scraping#headless-browser#ruby

factbook/factbook.json

Provides free, open-domain data on countries from the World Factbook in JSON format.

1.1K
Active
API Clients & Testing
Data Sources
#open-data#json#country-profiles

bytebuff/JSpider

JSpider is a JavaScript web scraping tool that automatically decrypts and extracts data from websites.

1.1K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node.js
#web-scraping#javascript#nodejs

johnwmillr/LyricsGenius

A Python library for downloading song lyrics and metadata from Genius.com.

1.1K
Stable
Python
API Clients & Testing
Backend Frameworks
#lyrics#genius-api#song-lyrics

elixir-crawly/crawly

Crawly is a high-level web crawling and scraping framework for Elixir, enabling developers to extract data from websites efficiently.

1.1K
Experimental
Elixir
Backend Frameworks
Caching
#crawler#crawling#scraper

skygazer42/DL-Hub

A collection of notes and projects on machine learning, deep learning, computer vision, NLP, and web scraping.

1.1K
Active
Python
LLM Frameworks
Databases
Python
#machine-learning#deep-learning#computer-vision

r3nt0n/bopscrk

A Python tool to generate smart and powerful wordlists for password cracking and cybersecurity testing.

1.1K
Archived
Python
Security Research
CLI Tools
#password-cracking#wordlist-generator#cybersecurity

jonbakerfish/TweetScraper

A simple Python library for scraping tweets from Twitter without using the official API.

1.1K
Archived
Python
Backend & APIs
CLI Tools
#twitter#scraping#tweets

jaimeiniesta/metainspector

Ruby gem for web scraping that extracts titles, meta data, links, and images from a given URL.

1.0K
Active
Ruby
Backend Frameworks
API Clients & Testing
#web-scraping#data-extraction#meta-data

bellingcat/auto-archiver

A Python-based tool to automatically archive links to videos, images, and social media content from Google Sheets and other sources.

1.0K
Active
Python
API Frameworks
CLI Tools
Docker
#archive#web-scraping#google-sheets

neon-mmd/websurfx

An open-source, privacy-focused meta search engine built with Rust, offering fast and secure web browsing.

1.0K
Active
Rust
API Frameworks
Backend Frameworks
#meta-search-engine#privacy-focused#open-source

supzhang/epg

A Python-based library for scraping TV program listings (EPG) from multiple sources and generating TVXML files.

1.0K
Archived
Python
API Frameworks
APIs
#tv-listings#epg#scraping

utkarshkukreti/select.rs

A Rust library for web scraping that can extract useful data from HTML documents.

1.0K
Experimental
Rust
Backend Frameworks
ORMs & Query Builders
#web-scraping#html-parsing#data-extraction

jfilter/clean-text

A Python library for cleaning and preprocessing text data, useful for NLP tasks.

1.0K
Active
Python
API Frameworks
Data Preprocessing
Python
#natural-language-processing#text-cleaning#text-normalization
1...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.