Bookmarked archived links
-
Updated
Jun 13, 2024
Bookmarked archived links
submit urls.txt to web archive using GitHub Action
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Parser for WARC (aka WebArchive) files
Quick Cache and Archive search buttons
Seeder - Czech webarchive curating tool and public site
A robust web archive analytics toolkit
Greasemonkey script that redirects from a 404 page to the Wayback Machine.
A continuation of legacy XUL version of DownThemAll! ✔️preserves web.archive.org timestamps, ✔️advanced filters for remote directory tree mirroring, ✔️UI is tweaked for better UX
link archive for year 2023
This command line converts .html file to Safari's .webarchive file.
WebBEAT website data extractor
Navigator for Web Archive
Aplikace slouží jako automatizované řešení pro identifikaci a popis mrtvých webů. Následně je ukládá do vlastní databáze a zpřístupňuje kurátorům, kteří s informacemi v ní dále nakládají, interpretují je a obsah klasifikují.
Shepherding our web archives from crawl to access.
Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.
Parse a Heritrix crawl.log into an XML sitemap
Catalogization tool for the czech webarchive.
Simple python OSINT tool for urls recon thanks to the waybackmachine.
Add a description, image, and links to the webarchive topic page so that developers can more easily learn about it.
To associate your repository with the webarchive topic, visit your repo's landing page and select "manage topics."