If you combine the LLM probability distribution with arithmetic coding you can actually use them to ...

frotaur • today at 9:29 AM • 0 replies • view on HN

If you combine the LLM probability distribution with arithmetic coding you can actually use them to compress text losslessly. When people reports 'bits per byte', it is actually the compression rate for text.

GPT-2 for instance achieves roughly 1 bit per byte, so it can be used to compress (english) text 8-fold. Modern models are likely much better.

alt Hacker News