You're right. It does seem like a suboptimal format in terms of memory usage efficiency

The tokens all have int IDs, this is just how they're rendered.