logoalt Hacker News

bitwizeyesterday at 8:38 PM2 repliesview on HN

> Reminded me of when I had to be extra wordy to meet the 1000 minimum word limit for my high school essays.

A huge AI signal to me is not em dashes, not emoji, not even the "not X, it's Y" construction which oh god I'm falling into the trap right now aren't I.

It's a combination of these factors plus a tendency to fluff out the piece with punchy but vague language, often recapitulating the same points in slightly reworded ways, that sounds like... an eighth grader trying to write an impressive-sounding essay that clears the minimum word limit.

Did the bright sparks who trained these things just crack open the printer paper boxes in their parents' homes filled with their old schoolwork, and feed that into the machine to get it started?


Replies

zbentleyyesterday at 9:55 PM

Another commenter above this proposed a pretty compelling theory for the source of this style: SEO-inflated prose online. If the models were trained on the internet, "higher quality" content needed to be indicated to them during RL somehow. Search engine ranking is an easy-to-obtain metric that's kind of like "quality" if you squint, turn around, and lobotomize yourself. So the AIs have a high likelihood of producing the kinds of content that is rewarded by Google SEO.

show 2 replies
Gigachadtoday at 12:58 AM

Another hint is when the structure and formality of the response doesn’t match the medium. Like when someone sends you a whole article back in DMs along with headings for the sections.

Even though real humans write like that when writing documents, they never did that in informal messaging.