I just found out the other day that the official Internet Archive/Wayback Machin...

ajdude · on Oct 12, 2023

I wish I knew about this sooner! They also have it for firefox https://addons.mozilla.org/en-US/firefox/addon/wayback-machi...

willsmith72 · on Oct 12, 2023

I see they have a private mode, how does this work exactly? How does it make sure it's archiving the home page of my bank's website and not my personal dashboard?

Kye · on Oct 12, 2023

It probably nudges the crawler to come rather than sending the page. If it's private, the crawler won't be able to see it.

tivert · on Oct 13, 2023

> It probably nudges the crawler to come rather than sending the page. If it's private, the crawler won't be able to see it.

I'm sure that's exactly what it does. It is definitely not archiving some Facebook pages I've been browsing (that I would actually like to archive). It's only been capturing the login screen or an error about abusing the site.

guraf · on Oct 13, 2023

It doesn't share your cookies, that probably stops almost any leak. Then probably rules to filter out legacy ?sessid= ?sID=.

Other than that I imagine it leaks things like /bank/account/id/75795 but the crawler will drop it when it returns a 40X.

quickthrower2 · on Oct 13, 2023

Or things like private shares to Dropbox, Zoom, YT etc.? Password resets? One time login links? Etc.