I've also done a full network replica (all the data indexed in a postgres) on a raspberry pi (with an 8tb nvme attached via a hat). Its really not expensive to do . And if I wanted to drop data older than say 3 months, it would be even cheaper still.
Case in point! This is an often mentioned statement on which the argument that "atproto is no decentralized" largely hinges. There are honest atproto digs out there but that is not one.