Hacker News new | past | comments | ask | show | jobs | submit login

The Library of Congress does currently archive limited collections of the internet[0]. They have a blog post[1] breaking down the effort, currently it's 8 full time staff with a team of part time members. According to Wikipedia[2], it's built on Heritrix and Wayback which are both developed by the Internet Archive (blog post also mentions "Wayback software"). Current archives are available at: http://webarchive.loc.gov/

[0] https://www.loc.gov/programs/web-archiving/about-this-progra...

[1] https://blogs.loc.gov/thesignal/2023/08/the-web-archiving-te...

[2] https://en.m.wikipedia.org/wiki/List_of_Web_archiving_initia...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: