I'll take a contrarian view and probably get downvoted for it, but many people are adamant about the benefits of indiscriminate archiving, and I just don't see it. Do we have a moral right to keep a copy of everything that's ever been written on the internet, basically just for the sake of it?
Sure, there's a variety of official and quasi-officials resources that should be treated as public record and preserved. And arguably, there are things that rise to the level of a cultural phenomenon and where the benefit of keeping receipts outweighs the jerk factor of never asking for permission and not respecting the wishes of private individuals.
But if it's some family blog from 30 years ago that's been deliberately taken down and lives on archive.org unbeknownst to the original owner? Do we have a right to that? To what end, other than "well, future historians may need it"? A historian won't look at it. A person trying to doxx you or shame you will.
I have also restored the very first articles I wrote in the 90s when I was young <https://vincent.bernat.ch/en/blog/2026-old-web-articles>. At the time, they were not "great," but now I think they have some limited historical values. For example, one of them is about how the national phone operator was billing minutes. The information was easy to find in the past but is pretty scarce now.
I didn't use the wayback machine because it didn't archive everything I needed and because I still had the files on my hard drive, but if I didn't, I would have been happy to recover them.
Archives have not taken a consistent stance on this. Preservation, removal, restricted access, de-indexing, and “right to be forgotten” sit on a spectrum.
Good archival practice has to include judgment, context, and humility.
Sometimes that means preserving. Sometimes it means limiting access. And sometimes it may mean "honoring" a removal request or court order, even if you're just setting a flag.
That's a good point, and AI makes this worse unfortunately. I realized a while ago I spend more time taking pictures than I even do looking at them, and quit worrying so much about saving everything. As they say, all these moments will be lost in time, like tears in rain. On the other hand, it's a memory of an internet that no longer exists, and that I think a lot of people (myself included) miss dearly.
I do suspect a lot of things weren't deliberately taken down so much as just not maintained, though.
I agree, indiscriminate archival is doomed to fail. Spending all of your resources backing things up leaves you with no resources to curate. This is where archive.org goes wrong I think. I think identifying information that is valuable and distilling, reformatting, and republishing it is much more important.
History is not only what "official" resources want you to believe.
Those who want a "right to be forgotten" are really advocating for the "right to rewrite history".
In the pre-digital era, the norm was that content was preserved. Simply because it was printed in newspapers, magazines, books, alsbums, etc.. in thousands (or hundreds of thousands) of copies, so merely by inertia, many copies survived squirreled away in libraries, attics and other storage.
Also, once printed and distributed, it couldn't be taken back and altered since countless original copies existed.
Sure, a lot of it wasn't that important, but only in hindsight of history does it become apparent what was important. So it was possible to go back and research those old unaltered originals.
I fear for history in the digital era. Everything is fleeting, everything is erased and everything can be retroactively altered when the powers-that-be don't like what was said. Thinking centuries ahead, reliable historical records basically stop around 2000-2020 or so.