Showing 1-5 of 5 projects
Self-hosted web archiving tool for preserving URLs, bookmarks, and media.
Heritrix is an open-source, extensible web crawler for archiving websites at scale.
A web crawler tool that outputs WARC files and provides a dashboard for managing crawls.
Collects and revisits web pages using Python.
A high-fidelity web archiving extension for Chrome and Chromium-based browsers, built with TypeScript.
Get weekly updates on trending AI coding tools and projects.