>from biology ... much greater efficiency is possible
Those are much more specialized models with pretty mediocre tokens per second.
Perhaps tokens is a dead end?
[dead]
Perhaps tokens is a dead end?