Low latency is nice. But it would be more interesting if they could demonstrate the efficiency of energy consumption.

Tokens/seconds and watt-hours seem related?