Sadly, even software-filled TLBs look to be a thing of the past. Apparently a hardware page-table walker is just that much faster? I’m not sure.
Why is that surprising? The trap into kernel mode alone would already take more cycles than dedicated hardware needs for the full page table walk.
Why is that surprising? The trap into kernel mode alone would already take more cycles than dedicated hardware needs for the full page table walk.