Word boundaries are a complex thing, especially in languages like Chinese or Japanese. Whitespace and punctuation are much less complicated, even if we take the full Unicode case. So the boundary where formatting is considered is between (whitespace | punctuation) and anything else.
So now you have to distribute a character class table in every implementation of your language, which is precisely what the author of Djot wanted to avoid.