I don't think there is a fundamental divide between implementation speedups and optimization and algorithmic/architecture optimizations
A speedup that changes nothing else is just that: a speedup that changes nothing else.
A speedup that changes nothing else is just that: a speedup that changes nothing else.