Two things; the model and runtime matters a lot, smaller/quantized models are basically useless...

embedding-shape • yesterday at 1:20 PM • 1 reply • view on HN

Two things; the model and runtime matters a lot, smaller/quantized models are basically useless at strict instruction following, compared to SOTA models. The second thing is that "never do X" doesn't work that well, if you want it to "never do X" you need to adjust the harness and/or steer it with "positive prompting" instead. Don't do "Never use uppercase" but instead do "Always use lowercase only", as a silly example, you'll get a lot better results. If you've trained dogs ("positive reinforcement training") before, this will come easier to you.

Replies

jug • yesterday at 11:58 PM

It's interesting to note here that Anthropic indeed don't use "do not X" in the Opus system prompts. However, "Claude does not X" is very common.

➕ show 1 reply

alt Hacker News

Replies