diff --git a/README.md b/README.md index d0e4e2d..cead9b0 100644 --- a/README.md +++ b/README.md @@ -4,6 +4,7 @@ Feel free to contribute! ## Download utilities ### Web Archiving +* [ArchiveBox](https://github.com/pirate/ArchiveBox): The open source self-hosted web archive. Takes browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more... * [Collect](https://github.com/xarantolus/Collect): A server to collect & archive websites that also supports video downloads * [grab-site](https://github.com/ludios/grab-site): The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns * [Heritrix](https://github.com/internetarchive/heritrix3): Extensible, web-scale, archival-quality web crawler