> I expect that a hypothetical advance in fabrication enabling processing elements to be placed directly adjacent to dense RAM on the same silicon (not merely in the same package) would be superior in all regards.
Processing scales better than DRAM does. I think an HBM-like stack where the bottom layer has the math units is probably the ultimate form of that.
And it's possible that flash instead of DRAM is actually the better play, as long as you can hook up enough in parallel. RIP Optane.