Low latency is nice. But it would be more interesting if they could demonstrate the efficiency of energy consumption.
Tokens/seconds and watt-hours seem related?
Tokens/seconds and watt-hours seem related?