logoalt Hacker News

procaryoteyesterday at 7:52 PM0 repliesview on HN

also, the redundancy means that you get a pretty good heuristic for "is this utf-8". Random data or other encodings are pretty unlikely to also be valid utf-8, at least for non-tiny strings