efficiency per token has tanked but it's still faster. given this is the first generation for Cerberas hardware this is the worst it's ever going to be.

when it reaches the main 5.3 codex efficiency at this token rate this kind of articles will seem silly in retrospect

Yeah, the progress is still incredibly impressive even if 15× is overstated. Curious to see how far it goes in the future.

[deleted]