mirror of
https://github.com/simon987/awesome-datahoarding
synced 2025-04-10 14:16:48 +00:00
Merge branch 'master' into master
This commit is contained in:
commit
31ba4f21a7
98
README.md
98
README.md
@ -4,42 +4,44 @@ Feel free to contribute!
|
||||
|
||||
### Download/Scraping utilities
|
||||
|
||||
* [Rclone](https://github.com/ncw/rclone): A command line program to sync files and directories to and from various cloud storage providers
|
||||
* [aria2](https://aria2.github.io/): A lightweight multi-protocol & multi-source command-line download utility
|
||||
* [wget](https://savannah.gnu.org/git/?group=wget): Utility for non-interactive download of files from
|
||||
the Web (HTTP & FTP)
|
||||
* [curl](https://github.com/curl/curl): Tool and library for transferring data with URL syntax, supporting many protocols
|
||||
* [Youtube-DL](https://github.com/rg3/youtube-dl): A command-line program to download videos from YouTube and a few hundred more sites
|
||||
* [annie](https://github.com/iawia002/annie): Youtube-DL alternative writtent in Golang
|
||||
* [wikiteam](https://github.com/WikiTeam/wikiteam): set of tools for archiving wikis
|
||||
* [FicSave](https://github.com/waylaidwanderer/FicSave): Online fanfiction downloader
|
||||
* [FanFicFare](https://github.com/JimmXinu/FanFicFare): Tool for making eBooks from stories on fanfiction and other web sites
|
||||
* [yt-mango](https://github.com/terorie/yt-mango): Youtube metadata archiver
|
||||
* [Youtube-MA](https://github.com/CorentinB/YouTube-MA): Youtube metadata archiver
|
||||
* [CrowLeer](https://github.com/ERap320/CrowLeer): Powerful C++ web crawler based on libcurl
|
||||
* [floatplane_ripper](https://gist.github.com/simon987/0756c378ca2dfb0003931e26ff7fe270): Script to rip all videos from https://floatplane.rip/
|
||||
* [grab-site](https://github.com/ludios/grab-site): The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
|
||||
* [dzi-dl](https://github.com/ryanfb/dzi-dl): Deep Zoom Image Downloader
|
||||
* [iiif-dl](https://github.com/ryanfb/iiif-dl): Command-line tile downloader/assembler for IIIF endpoints/manifests
|
||||
* [aria2](https://aria2.github.io/): A lightweight multi-protocol & multi-source command-line download utility
|
||||
* [bazarr](https://github.com/morpheus65535/bazarr): Companion application to Sonarr and Radarr for downloading subtitles
|
||||
* [ChanThreadWatch](https://github.com/SuperGouge/ChanThreadWatch): Saves threads from \*chan-style boards and checks for updates until the thread dies
|
||||
* [Sonarr](https://github.com/Sonarr/Sonarr): PVR for Usenet and BitTorrent users
|
||||
* [Radarr](https://github.com/Radarr/Radarr): A fork of Sonarr to work with movies à la Couchpotato
|
||||
* [Collect](https://github.com/xarantolus/Collect): A server to collect & archive websites that also supports video downloads
|
||||
* [CrowLeer](https://github.com/ERap320/CrowLeer): Powerful C++ web crawler based on libcurl
|
||||
* [curl](https://github.com/curl/curl): Tool and library for transferring data with URL syntax, supporting many protocols
|
||||
* [dzi-dl](https://github.com/ryanfb/dzi-dl): Deep Zoom Image Downloader
|
||||
* [FanFicFare](https://github.com/JimmXinu/FanFicFare): Tool for making eBooks from stories on fanfiction and other web sites
|
||||
* [FicSave](https://github.com/waylaidwanderer/FicSave): Online fanfiction downloader
|
||||
* [FlexGet](https://github.com/Flexget/Flexget): Multipurpose automation tool for content like torrents, nzbs, podcasts, comics, series, movies, etc
|
||||
* [floatplane_ripper](https://gist.github.com/simon987/0756c378ca2dfb0003931e26ff7fe270): Script to rip all videos from https://floatplane.rip/
|
||||
* [gallery-dl](https://github.com/mikf/gallery-dl): Fownload image galleries and collections from pixiv, exhentai, danbooru and more
|
||||
* [grab-site](https://github.com/ludios/grab-site): The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
|
||||
* [Heritrix](https://github.com/internetarchive/heritrix3): Extensible, web-scale, archival-quality web crawler
|
||||
* [HTTrack](https://www.httrack.com/): Download a website from the Internet to a local directory
|
||||
* [iiif-dl](https://github.com/ryanfb/iiif-dl): Command-line tile downloader/assembler for IIIF endpoints/manifests
|
||||
* [Instagram Scraper](https://github.com/dankmemes/instagram-scraper): Instagram-scraper is a command-line application written in Python that scrapes and downloads an instagram user's photos and videos. Use responsibly.
|
||||
* [Jackett](https://github.com/Jackett/Jackett): API support for torrent trackers (works with Sonarr, Radarr and others)
|
||||
* [Sick-Beard](https://github.com/midgetspy/Sick-Beard): PVR for newsgroup users (with limited torrent support)
|
||||
* [Lidarr](https://github.com/lidarr/Lidarr): Music collection manager for Usenet and BitTorrent users
|
||||
* [Mylar](https://github.com/evilhero/mylar): An automated Comic Book downloader (cbr/cbz) for use with SABnzbd, NZBGet and torrents
|
||||
* [bazarr](https://github.com/morpheus65535/bazarr): Companion application to Sonarr and Radarr for downloading subtitles
|
||||
* [RipMe](https://github.com/RipMeApp/ripme): RipMe is an album ripper for various websites. Runs on your computer. Requires Java 8.
|
||||
* [Instagram Scraper](https://github.com/dankmemes/instagram-scraper): Instagram-scraper is a command-line application written in Python that scrapes and downloads an instagram user's photos and videos. Use responsibly.
|
||||
* [gallery-dl](https://github.com/mikf/gallery-dl): Fownload image galleries and collections from pixiv, exhentai, danbooru and more
|
||||
* [FlexGet](https://github.com/Flexget/Flexget): Multipurpose automation tool for content like torrents, nzbs, podcasts, comics, series, movies, etc
|
||||
* [Plowshare](https://github.com/mcrapet/plowshare): Command-line tool to manage file-sharing site
|
||||
* [PyInstaLive](https://github.com/notcammy/PyInstaLive): Instagram live stream downloader.
|
||||
* [Radarr](https://github.com/Radarr/Radarr): A fork of Sonarr to work with movies à la Couchpotato
|
||||
* [Rclone](https://github.com/ncw/rclone): A command line program to sync files and directories to and from various cloud storage providers
|
||||
* [RedditDownloader](https://github.com/shadowmoose/RedditDownloader): Scrapes Reddit to download media of your choice
|
||||
* [HTTrack](https://www.httrack.com/): Download a website from the Internet to a local directory
|
||||
* [Heritrix](https://github.com/internetarchive/heritrix3): Extensible, web-scale, archival-quality web crawler
|
||||
* [RipMe](https://github.com/RipMeApp/ripme): RipMe is an album ripper for various websites. Runs on your computer. Requires Java 8.
|
||||
* [Sick-Beard](https://github.com/midgetspy/Sick-Beard): PVR for newsgroup users (with limited torrent support)
|
||||
* [Sonarr](https://github.com/Sonarr/Sonarr): PVR for Usenet and BitTorrent users
|
||||
* [wail](https://github.com/machawk1/wail): Web Archiving Integration Layer: One-Click User Instigated Preservation
|
||||
* [Collect](https://github.com/xarantolus/Collect): A server to collect & archive websites that also supports video downloads
|
||||
* [Plowshare](https://github.com/mcrapet/plowshare): Command-line tool to manage file-sharing sites
|
||||
* [wget](https://savannah.gnu.org/git/?group=wget): Utility for non-interactive download of files from
|
||||
* [wikiteam](https://github.com/WikiTeam/wikiteam): set of tools for archiving wikis
|
||||
* [you-get](https://github.com/soimort/you-get): Dumb downloader that scrapes the web
|
||||
* [Youtube-DL](https://github.com/rg3/youtube-dl): A command-line program to download videos from YouTube and a few hundred more sites
|
||||
* [Youtube-MA](https://github.com/CorentinB/YouTube-MA): Youtube metadata archiver
|
||||
* [yt-mango](https://github.com/terorie/yt-mango): Youtube metadata archiver
|
||||
the Web (HTTP & FTP)
|
||||
|
||||
### Compression
|
||||
* [KGB Archiver](https://github.com/RandallFlagg/kgbarchiver): compression tool with unbelievable high compression rate
|
||||
@ -49,9 +51,9 @@ the Web (HTTP & FTP)
|
||||
* [NetLimiter](https://www.netlimiter.com/): Internet traffic control and monitoring tool for Windows
|
||||
|
||||
### File systems
|
||||
* [NTFS drivers for MacOS](https://www.seagate.com/ca/en/support/downloads/item/ntfs-driver-for-mac-os-master-dl/)
|
||||
* [httpdirfs](https://github.com/fangfufu/httpdirfs/): A filesystem which allows you to mount HTTP directory listings
|
||||
* [mergerfs](https://github.com/trapexit/mergerfs): a featureful union filesystem
|
||||
* [NTFS drivers for MacOS](https://www.seagate.com/ca/en/support/downloads/item/ntfs-driver-for-mac-os-master-dl/)
|
||||
|
||||
### File conversion
|
||||
* [AAXtoMP3](https://github.com/KrumpetPirate/AAXtoMP3): convert AAX files to common MP3, M4A, M4B, flac and ogg formats through a basic bash script frontend to FFMPEG
|
||||
@ -59,43 +61,43 @@ the Web (HTTP & FTP)
|
||||
|
||||
|
||||
### Utility Scripts
|
||||
* [rclone_dirsize](https://gist.github.com/simon987/7aff5ca3e9ae6c755055ca7b350ef9f8): Get size of http directory listing with rclone
|
||||
* [youtube-dl_soundcloud](https://gist.github.com/simon987/2dd7c57d65a741c93f5791bc984b97d1): snippet for using youtube-dl to download soundcloud playlists
|
||||
* [rm_empty_subdir](https://gist.github.com/simon987/f5c2cd7602898615ac9bc8c762d9fe1d): Remove empty sub-directories on Windows
|
||||
* [void-cat-uploader](https://github.com/takky1154/void-cat-uploader): This script automatically uploads all files inside a directory to https://void.cat.
|
||||
* [Backblaze B2 sync backup script](https://gist.github.com/AlexanderProd/cb645cf858fd5c89780e7df267226b80): Script to sync mutliple directories with Backblaze B2
|
||||
* [Misc download scripts](https://github.com/simon987/Misc-Download-Scripts): Scripts for downloading content from various websites
|
||||
* [rclone_dirsize](https://gist.github.com/simon987/7aff5ca3e9ae6c755055ca7b350ef9f8): Get size of http directory listing with rclone
|
||||
* [rm_empty_subdir](https://gist.github.com/simon987/f5c2cd7602898615ac9bc8c762d9fe1d): Remove empty sub-directories on Windows
|
||||
* [void-cat-uploader](https://github.com/takky1154/void-cat-uploader): This script automatically uploads all files inside a directory to https://void.cat.
|
||||
* [youtube-dl_soundcloud](https://gist.github.com/simon987/2dd7c57d65a741c93f5791bc984b97d1): snippet for using youtube-dl to download soundcloud playlists
|
||||
|
||||
### Content sharing
|
||||
* [opds](https://opds.io/): Easy to use, Open & Decentralized Content Distribution
|
||||
* [ipfs](https://ipfs.io/): Protocol and network designed to create a content-addressable, peer-to-peer method of storing and sharing hypermedia in a distributed file system
|
||||
* [h5ai](https://github.com/lrsjng/h5ai): HTTP web server index for Apache httpd, lighttpd, nginx and Cherokee
|
||||
* [ipfs](https://ipfs.io/): Protocol and network designed to create a content-addressable, peer-to-peer method of storing and sharing hypermedia in a distributed file system
|
||||
* [opds](https://opds.io/): Easy to use, Open & Decentralized Content Distribution
|
||||
|
||||
### Data curation
|
||||
* [baobab](https://github.com/GNOME/baobab): Graphical disk usage analyzer
|
||||
* [beets](https://github.com/beetbox/beets): Music library manager and MusicBrainz tagger
|
||||
* [Calibre](https://github.com/kovidgoyal/calibre): Ebook manager
|
||||
* [DeepSort](https://github.com/CorentinB/DeepSort/): AI powered image tagger backed by DeepDetect
|
||||
* [diskover](https://github.com/shirosaidev/diskover): File system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
|
||||
* [fucking-weeb](https://github.com/cosarara/fucking-weeb): A library manager for animu (and TV shows, and whatever).
|
||||
* [Everything](https://www.voidtools.com/): Locate files and folders by name instantly (Windows)
|
||||
* [beets](https://github.com/beetbox/beets): Music library manager and MusicBrainz tagger
|
||||
* [picard](https://github.com/metabrainz/picard): MusicBrainz tagger
|
||||
* [Calibre](https://github.com/kovidgoyal/calibre): Ebook manager
|
||||
* [WinDirStat](https://windirstat.net/): Disk usage statistics viewer and cleanup tool for Windows
|
||||
* [jdupes](https://github.com/jbruchon/jdupes): Powerful duplicate file finder
|
||||
* [Mp3tag](https://www.mp3tag.de): Powerful and easy-to-use tool to edit metadata of audio files (Windows/Mac)
|
||||
* [FileBot](https://www.filebot.net/): FileBot is the ultimate tool for organizing and renaming your Movies, TV Shows and Anime
|
||||
* [MediaInfo](https://github.com/MediaArea/MediaInfo): Convenient unified display of the most relevant technical and tag data for video and audio files
|
||||
* [tree](http://mama.indstate.edu/users/ice/tree/): 'tree' command for linux
|
||||
* [fucking-weeb](https://github.com/cosarara/fucking-weeb): A library manager for animu (and TV shows, and whatever).
|
||||
* [grepWin](https://github.com/stefankueng/grepWin): A powerful and fast search tool using regular expressions (Windows)
|
||||
* [TeraCopy](https://www.codesector.com/downloads): Copy your files faster and more securely
|
||||
* [baobab](https://github.com/GNOME/baobab): Graphical disk usage analyzer
|
||||
* [phockup](https://github.com/ivandokov/phockup): Media sorting tool to organize photos and videos from your camera
|
||||
* [jdupes](https://github.com/jbruchon/jdupes): Powerful duplicate file finder
|
||||
* [MediaElch](https://github.com/komet/mediaelch): Media manager for Kodi
|
||||
* [MediaInfo](https://github.com/MediaArea/MediaInfo): Convenient unified display of the most relevant technical and tag data for video and audio files
|
||||
* [Mp3tag](https://www.mp3tag.de): Powerful and easy-to-use tool to edit metadata of audio files (Windows/Mac)
|
||||
* [phockup](https://github.com/ivandokov/phockup): Media sorting tool to organize photos and videos from your camera
|
||||
* [picard](https://github.com/metabrainz/picard): MusicBrainz tagger
|
||||
* [TeraCopy](https://www.codesector.com/downloads): Copy your files faster and more securely
|
||||
* [tree](http://mama.indstate.edu/users/ice/tree/): 'tree' command for linux
|
||||
* [WinDirStat](https://windirstat.net/): Disk usage statistics viewer and cleanup tool for Windows
|
||||
|
||||
### APIs & Online tools
|
||||
* [thetvdb](https://www.thetvdb.com/): TV shows metadata (used by plex)
|
||||
* [iqdb](https://iqdb.org/): Multi-service reverse image search
|
||||
* [thetvdb](https://www.thetvdb.com/): TV shows metadata (used by plex)
|
||||
|
||||
### Hardware / Monitoring
|
||||
* [CrystalDiskInfo](https://crystalmark.info/en/software/crystaldiskinfo/): A HDD/SSD utility software which supports a part of USB, Intel RAID and NVMe.
|
||||
* [smartmontools](https://www.smartmontools.org/): Control and monitor storage systems using the (SMART) built into most modern ATA/SATA, SCSI/SAS and NVMe disks
|
||||
* [Hard Drive Sentinel](https://www.hdsentinel.com/): Multi-OS SSD and HDD monitoring and analysis software
|
||||
* [smartmontools](https://www.smartmontools.org/): Control and monitor storage systems using the (SMART) built into most modern ATA/SATA, SCSI/SAS and NVMe disks
|
||||
|
Loading…
x
Reference in New Issue
Block a user