I used to solely depend on Wayback machine to automate archiving pages. Now, I a...

keyle · 2024-10-16T07:47:00.000000Z

I was just fantasising earlier, daydreaming, about what a distributed warc or similar solution would look like, with peering and user or distributed server archiving. Either by browser plugin submission or passively sending the urls to servers to do the fetching and archiving (removes some of the privacy issues).

I think it's everyone's responsibility to make sure the web gets cached, not one org... and since Google has canned the Google cache.......

nikisweeting · 2024-10-16T14:44:20.000000Z

ArchiveBox v0.8 is adding the beginnings of a content addressable store for P2P sharing! Stay tuned :)

aspenmayer · 2024-10-16T07:58:10.000000Z

Here's some prior art:

https://wiki.archiveteam.org/index.php/Main_Page

Cthulhu_ · 2024-10-16T07:49:51.000000Z

There should be more internet archives, for various reasons, but it doesn't seem like anyone is willing to put in the effort and money involved, let alone the legal headaches.

jfil · 2024-10-16T12:30:18.000000Z

I agree. And I am dismayed that government and academic institutions like to dance around the legal issues of archiving (outsourcing the legal risk to Internet Archive), instead of pushing for legal protections/exemptions for the act of archiving.

razakel · 2024-10-16T12:37:56.000000Z

The UK and Portugal are both doing it for domestically published websites.

yamrzou · 2024-10-17T05:53:49.000000Z

Would you mind sharing your script?

yasser_kaddoura · 2024-10-17T06:12:29.000000Z

https://www.reddit.com/r/qutebrowser/comments/1g1zbel/usersc...

yamrzou · 2024-10-18T10:35:00.000000Z

Thank you. It says: "Sorry, this post was removed by Reddit’s filters."