Hacker News new | past | comments | ask | show | jobs | submit login

I just found out the other day that the official Internet Archive/Wayback Machine extension (https://chrome.google.com/webstore/detail/wayback-machine/fp...) has a mode to automatically archive every page you view. It's pretty neat, and I wish more people would run it to get a more diverse snapshot of stuff into the Wayback machine.



I wish I knew about this sooner! They also have it for firefox https://addons.mozilla.org/en-US/firefox/addon/wayback-machi...


I see they have a private mode, how does this work exactly? How does it make sure it's archiving the home page of my bank's website and not my personal dashboard?


It probably nudges the crawler to come rather than sending the page. If it's private, the crawler won't be able to see it.


> It probably nudges the crawler to come rather than sending the page. If it's private, the crawler won't be able to see it.

I'm sure that's exactly what it does. It is definitely not archiving some Facebook pages I've been browsing (that I would actually like to archive). It's only been capturing the login screen or an error about abusing the site.


It doesn't share your cookies, that probably stops almost any leak. Then probably rules to filter out legacy ?sessid= ?sID=.

Other than that I imagine it leaks things like /bank/account/id/75795 but the crawler will drop it when it returns a 40X.


Or things like private shares to Dropbox, Zoom, YT etc.? Password resets? One time login links? Etc.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: