Would archive.org typically honor a robots.txt for a resource it already retriev...

mikeash · on Aug 11, 2015

Apparently yes, it would: https://archive.org/about/exclude.php

syncsynchalt · on Aug 11, 2015

My understanding is that sites like archive.org honor robots.txt retroactively not because they are required to, but to best honor the wishes of the content provider.

X-Istence · on Aug 11, 2015

Yes, it simply hides the content, it is still kept in their database so if the robots.txt disappears, it pops back from their archive.

New pages won't be archived though.