Worth looking at | alt Hacker News

andy99 • yesterday at 4:39 PM • 1 reply • view on HN

Worth looking at https://www.anthropic.com/engineering/a-postmortem-of-three-...

They can “go insane” but it seems often to be infra related as opposed to anything one would consider hallucination. Smaller models will often get stuck repeating a word or phrase forever but that’s a bit different and nobody would call it hallucination.

Replies

tadfisher • yesterday at 6:53 PM

When you can reliably prompt these things into insanity, then it's demonstrably not an infrastructure issue.

➕ show 1 reply