logoalt Hacker News

ineedasernameyesterday at 10:59 PM1 replyview on HN

Language is leaky, it gets just about everywhere. Some LLM goes and spills a bunch of emdashes and subordinate clauses all over a billion folks’ browsers and a bunch of them— especially those that may come into contact with a lot of language for a living— writers, for example— and they soak up a bit of it themselves and smear it all around.

Put another way, search out the great vowel shift. That happened over more time but then again the contact with different speakers wasn’t as constant as every day on the internet. It’s just what happens, how things spread. No different and maybe to a further degree than typical memes.


Replies

ameliaquiningtoday at 1:46 AM

My suspicion is that the causation mostly goes the other way—LLMs write like that for the same reason that many humans do, namely, that it's a cheap trick for sounding smart with limited effort and cognitive capacity. (My guess would be that em-dash usage among human writers is down in the LLM era because people don't want to be accused of being LLMs, though I don't have any data on this.)

Coincidentally I just read a blog post today that explained this in a way I always struggled to: https://www.astralcodexten.com/p/nostalgebraists-hydrogen-ju...