logoalt Hacker News

bombcartoday at 3:36 AM1 replyview on HN

Don't we need more than an index of Archive.org because whomever controls the domain could robots.txt these out of existence if they wanted to?


Replies

ycombinetetoday at 4:33 AM

Archive.org mostly ignores robots.txt

https://blog.archive.org/2017/04/17/robots-txt-meant-for-sea...

show 2 replies