Explore Projects

Discover 170 open source projects

Active filters (1):
Search: crawlers×
Clear all

Showing 101-120 of 170 projects

JSREI/ast-hook-for-js-RE

A tool for exploring browser memory and implementing a crawler solution in JavaScript.

1.9K
Archived
JavaScript
Backend Frameworks
CLI Tools
Node
#crawler#browser-memory#reverse-engineering

lixi5338619/lxSpider

A Python crawler for various AI-powered platforms and websites

1.9K
Archived
Python
Python
#crawler#AI-powered#web-scraping

extractus/article-extractor

A Node.js library for extracting the main article content from a given URL using the Readability algorithm.

1.9K
Stable
JavaScript
API Frameworks
Backend Frameworks
Node
#article-extraction#web-scraping#readability

xianhu/PSpider

A simple and easy-to-use Python web scraping framework with support for multi-threading and proxies.

1.8K
Archived
Python
Backend & APIs
CLI Tools
Python
#crawler#web-scraper#multi-threading

hu17889/go_spider

A flexible and modular Go-based web crawler framework with a concurrent architecture.

1.8K
Archived
Go
API Frameworks
CLI Tools
#crawler#concurrent#pipeline

watercrawl/WaterCrawl

A versatile TypeScript-based tool that transforms web content into LLM-ready data for AI/ML applications.

1.8K
Active
TypeScript
LLM Wrappers & SDKs
Frontend Frameworks
TypeScript
#aicrawler#crawl4ai#crawler

u3c3/BT-btt

This repository appears to be related to a torrent site and does not seem to be focused on AI coding tools for developers.

1.8K
Archived
Uncategorized
#torrent#crawler#download

TeamNewPipe/NewPipeExtractor

A Java library for extracting data from various streaming platforms like YouTube, SoundCloud, and Bandcamp.

1.8K
Active
Java
API Frameworks
Backend Frameworks
#crawler#extractor#scraper

coder-hxl/x-crawl

Flexible and AI-assisted Node.js crawler library for building web scrapers and crawlers.

1.8K
Active
TypeScript
LLM Frameworks
API Frameworks
Node.js
#ai-crawl#chromium#crawler

diskoverdata/diskover-community

Diskover is an open-source file indexer, search engine, and analytics tool powered by Elasticsearch.

1.8K
Active
PHP
API Frameworks
Search
#crawler#file-indexing#file-search

howie6879/ruia

An async Python 3.6+ web scraping micro-framework based on asyncio for building high-performance crawlers and spiders.

1.7K
Archived
Python
Backend Frameworks
CLI Tools
Python
#web-scraping#asynchronous#asyncio

DanMcInerney/xsscrapy

An open-source web crawler and spider tool for detecting cross-site scripting (XSS) vulnerabilities.

1.7K
Archived
Python
Security Research
CLI Tools
Python
#web-crawler#xss-detection#penetration-testing

MarginaliaSearch/MarginaliaSearch

An internet search engine focused on indexing the small, old, and weird parts of the web.

1.7K
Active
HTML
Backend Frameworks
Search
Java
#search-engine#web-crawler#small-web

thecodrr/fdir

Fast directory crawler and globbing library for NodeJS

1.7K
Stable
TypeScript
React
#directory-crawler#globbing#fast-nodejs

delaford/game

A JavaScript 2D Medieval RPG with multiplayer capabilities

1.7K
Archived
JavaScript
Animation & Motion
React
#medieval-rpg#multiplayer-game#javascript-game

YoongiKim/AutoCrawler

A powerful Google and Naver web crawler built with Python, Selenium, and multiprocessing for efficient large-scale data collection.

1.7K
Archived
Python
Backend & APIs
Data Pipelines
Selenium
#web-crawler#multiprocessing#data-extraction

Python3Spiders/WeiboSuperSpider

A powerful Python-based web crawler and toolkit for scraping Weibo data, including user profiles, comments, images, and more.

1.7K
Archived
Python
Backend Frameworks
Databases
Python
#weibo#web-scraping#data-extraction

gigablast/open-source-search-engine

An open-source distributed search engine and web crawler written in C/C++ for Linux.

1.6K
Archived
C++
API Frameworks
Search
#search-engine#crawler#open-source

srx-2000/spider_collection

A collection of Python web scraping scripts for various websites and platforms, including music, video, and real estate data.

1.6K
Archived
Python
Backend Frameworks
ETL & Pipelines
#web-scraping#data-extraction#python-scripts

rushter/selectolax

A fast HTML5 parser with CSS selectors for Python, useful for web scraping and crawling tasks.

1.6K
Active
Cython
Frontend Frameworks
API Frameworks
Python
#web-scraping#html-parsing#css-selectors
1...57...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.