What do TPUs do to improve on GPUs at inference?

More compute