Explore Projects

Discover 255 open source projects

Active filters (1):
Search: scrapingร—
Clear all

Showing 161-180 of 255 projects

JonasCz/How-To-Prevent-Scraping

A guide on effective techniques to prevent website scraping and protect your web application's content.

1.5K
Archived
API Documentation
#web-scraping#content-protection#anti-scraping

emcf/thepipe

A Python library that helps developers extract structured data from tricky documents using vision-language models.

1.5K
Stable
Python
LLM Frameworks
ETL & Pipelines
Python
#document-processing#large-language-models#multimodal

tidyverse/rvest

A simple web scraping library for R, allowing developers to extract data from websites.

1.5K
Stable
R
React
#web-scraping#html#r

yhat/scrape

A simple, higher-level interface for Go web scraping, suitable for various development tasks.

1.5K
Archived
Go
Backend Frameworks
API Clients & Testing
#web-scraping#http-client#automation

TheGP/untidetect-tools

A collection of tools and browsers for web scraping, bot automation, and evading detection.

1.5K
Stable
API Frameworks
CLI Tools
#anti-detection#scraping#automation

oxylabs/how-to-scrape-google-images

Automated Google image scraper that retrieves and parses image data using HTTP requests.

1.5K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-image-api#image-scraping#web-scraping

requests-cache/requests-cache

Requests-cache is a Python library that provides a persistent HTTP cache for the requests library, improving performance for web scraping and API calls.

1.5K
Active
Python
HTTP Clients
Caching
Python
#http#web-scraping#caching

m8sec/CrossLinked

A LinkedIn enumeration tool that extracts valid employee names from an organization through web scraping.

1.5K
Archived
Python
Penetration Testing
CLI Tools
Python
#enumeration#osint#webscraping

ulixee/hero

A web browser built for scraping, providing a powerful and extensible platform for automating web interactions.

1.5K
Active
TypeScript
Backend Frameworks
CLI Tools
TypeScript
#web-scraping#automation#browser-automation

ape-byte/DouyinBarrageGrab

A C# program to scrape Douyin (TikTok) live stream chat messages using system proxies.

1.5K
Experimental
C#
Realtime
Backend Frameworks
#livestream#scraping#douyin

oxylabs/how-to-scrape-google-flights

A Python library for scraping and analyzing flight data from Google Flights API.

1.5K
Stable
Python
API Clients & Testing
Backend Frameworks
Python
#google-flight-api#google-flights-api#scrape-google-flights

dembrandt/dembrandt

A JavaScript tool that automatically extracts a website's design system into usable tokens for building UI components.

1.5K
Active
JavaScript
Component Libraries (React)
CLI Tools
React
#automation#design-system#design-tokens

roach-php/core

A comprehensive web scraping toolkit for PHP developers, with capabilities for crawling, parsing, and extracting data from websites.

1.5K
Stable
PHP
Backend Frameworks
API Frameworks
#crawling#web-scraping#php

jamesturk/scrapeghost

An experimental Python library for scraping websites using OpenAI's GPT API.

1.4K
Active
Python
LLM Wrappers & SDKs
API Clients & Testing
Python
#openai#webscraping#gpt

tcc0lin/Review_Reverse

A repository focused on web scraping and automation using JavaScript, Puppeteer, and Python.

1.4K
Archived
JavaScript
Backend Frameworks
API Clients & Testing
JavaScript
#web-scraping#automation#puppeteer

Medium/phantomjs

A NPM wrapper for installing the PhantomJS headless browser, useful for web scraping and automated testing.

1.4K
Archived
JavaScript
Backend Frameworks
CLI Tools
React
#headless-browser#web-scraping#automated-testing

jonnnnyw/php-phantomjs

Execute PhantomJS commands through PHP, enabling server-side automation and web scraping tasks.

1.4K
Archived
PHP
Backend Frameworks
PHP
#web-scraping#automation#headless-browser

oxylabs/how-to-scrape-amazon-prices

Extracts best-selling items, search results, and deals from Amazon using Python and Oxylabs E-Commerce Scraper API.

1.4K
Stable
Python
React
#amazon-scraper#api#python-scraper

oxylabs/how-to-scrape-google-scholar

A Python library for extracting titles, authors, and citations from Google Scholar using web scraping.

1.4K
Stable
Python
Backend & APIs
CLI Tools
Python
#google-scholar#web-scraping#web-crawler

damklis/DataEngineeringProject

An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.

1.4K
Archived
Python
ETL & Pipelines
API Frameworks
Django
#data-engineering#data-pipeline#etl
1...810...13

Stay in the loop

Get weekly updates on trending AI coding tools and projects.