Set this up a couple weeks using an proxmox lxc script and have it using ollama to create tags. I hadn’t heard of singlefile before. That seems like an excellent pairing.
Thoughts on this vs something like ArchiveBox?
Talking about hoarding, LTO tapes are the king of cheap storage, but if you want to archive significant amounts (hundreds of TB or more), it takes a significant investment to buy a tape library with somewhat recent drive. Too bad there aren't any alternatives - or are there?
Didn't realize Hoarder now supports SingleFile extension. amazing.
Regarding Hoarder - by selfhosting Hoarder , I was able to cancel my $40/year subscription to Pocket. With the money saved - I added $10 of OpenAI's API credits and use gpt-4o-mini for tagging. I don't have a powerful enough GPU to selfhost Ollama on my NAS where I'm hosting Hoarder. But gpt-4o-mini is dirt cheap for these type of use cases.
Worth noting that Linkding (what the author migrated from to Hoarder) also now supports page archiving via headless Chrome + SingleFile and also via manual upload: https://linkding.link/archiving/
Can Hoarder archive a webpage protected by some kind of auth / login?
btw the hoarder project is an active victim of a patent troll[0][1]; the official Firefox extension is currently blocked by dmca[2]. any donations might be helpful.
[0]: https://github.com/hoarder-app/hoarder/commit/b2c795ccb562c0...
[1]: https://www.reddit.com/r/selfhosted/s/CMCPP7cc8i
[2]: https://github.com/hoarder-app/hoarder/issues/899