logoalt Hacker News

rayineryesterday at 6:25 PM8 repliesview on HN

Oh my god did we inadvertently train AIs on idiotspeak.


Replies

canucker2016yesterday at 8:38 PM

It seems to be called the Rule of 3. see https://en.wikipedia.org/wiki/Rule_of_three_(writing)

Like Caesar's supposed "Veni Vidi Vici" saying, people seem to prefer and remember items when grouped in three.

I recall a public speaking film shown to my management science class starring John Cleese mentioning this rule of 3.

Exoristostoday at 12:34 AM

There was nothing inadvertent about it. A decade of cultivating and harvesting millions of examples of this kind of pseudo-writing from underpaid internet piece-workers preceded LLMs.

Sharlinyesterday at 8:40 PM

Given that this specific style is the result of being reinforced over and over again via RLHF, "inadvertently" isn't really the word I'd use.

LearnYouALispyesterday at 6:35 PM

In-advert-ently?

12_throw_awayyesterday at 7:54 PM

> did we inadvertently train AIs on idiotspeak.

Nope! That is - training on lowest-common-denominator, low-signal high-noise "idiotspeak" was not at all inadvertent.

BoredPositronyesterday at 8:10 PM

It's engaging and I doubt it happened by accident.

andrepdyesterday at 10:14 PM

No, we actually trained it on standardised tests https://marcusolang.substack.com/p/im-kenyan-i-dont-write-li...

slowmovintargetyesterday at 9:44 PM

* Checks notes *

Reddit

Twitter

Facebook

4chan

Call of Duty chat logs

Every public marketing site

SlashDot

UseNet

...

Verdict: Yes idiotspeak was part of the training set, but no, it was not inadvertent. There's a smattering of Shakespeare in there, at least.