logoalt Hacker News

mort96yesterday at 8:07 PM2 repliesview on HN

This feels like it should be solveable with introducing a few more marker characters, like one code point representing "the following text is traditional Chinese", "the following text is Japanese", etc? It would add even more statefulness to Unicode, but I feel like that ship has already sailed with the U+202D LEFT-TO-RIGHT OVERRIDE and U+202E RIGHT-TO-LEFT OVERRIDE characters...


Replies

fanf2yesterday at 8:36 PM

Unicode used to have a system of in-band language tags, but it was deprecated https://www.unicode.org/faq//languagetagging.html

cyberaxtoday at 3:49 AM

There is a way to do it: https://en.wikipedia.org/wiki/Variation_Selectors_(Unicode_b...

However, it's not used widely and has problems with variant-naïve fonts.

show 1 reply