You're right. It does seem like a suboptimal format in terms of memory usage efficiency
The tokens all have int IDs, this is just how they're rendered.
The tokens all have int IDs, this is just how they're rendered.