logoalt Hacker News

xigoilast Saturday at 4:46 PM1 replyview on HN

If I’m understanding correctly, your rules are based on word boundaries? How do you define a word boundary?


Replies

nine_kyesterday at 3:40 AM

Word boundaries are a complex thing, especially in languages like Chinese or Japanese. Whitespace and punctuation are much less complicated, even if we take the full Unicode case. So the boundary where formatting is considered is between (whitespace | punctuation) and anything else.

show 1 reply