Word boundaries are a complex thing, especially in languages like Chinese or Japanese. Whitespace an...

nine_k • yesterday at 3:40 AM • 1 reply • view on HN

Word boundaries are a complex thing, especially in languages like Chinese or Japanese. Whitespace and punctuation are much less complicated, even if we take the full Unicode case. So the boundary where formatting is considered is between (whitespace | punctuation) and anything else.

Replies

xigoi • yesterday at 6:17 AM

So now you have to distribute a character class table in every implementation of your language, which is precisely what the author of Djot wanted to avoid.

alt Hacker News

Replies