but they're not. Ofyen the confidence value is much lower. I should have an option to see how confident it is. (maybe set the opacity of each token to its confidence?)

Logits aren't confidence about facts. You can turn on a display like this in the openai playground and you will see it doesn't do what you want.