>from biology ... much greater efficiency is possible

Those are much more specialized models with pretty mediocre tokens per second.

Perhaps tokens is a dead end?

Perhaps! But perhaps whatever human brains use instead of tokens is not as amenable to scaling or copying.

[dead]