logoalt Hacker News

ChuckMcMyesterday at 5:53 PM1 replyview on HN

I love that book, that said, the point is more subtle than that. Current LLM attention models are limited in their feedback. Adding a form of 'shame' feedback (result is technically correct but morally bad or some such) would help here but I doubt the folks building theses things would choose to do so.


Replies

jerfyesterday at 6:53 PM

From a certain and quite valid point of view, they have no mechanism for feedback at all. Every time you start a conversation you're starting in the same state, modulo the random numbers. At most you have this very, very vague loop in that the conversations for LLM 1.0 will be fed in to the training set for LLM 2.0.

Even "shame" would only apply to the current session and disappear in the next one, or eventually be compacted away.

(Although honorable mention to Gemini's meltdown: https://x.com/AISafetyMemes/status/1953397827662414022 )

show 1 reply