The "distillation attacks" are mostly using Claude as LLM-as-a-judge. They are not trainin...

red2awn • yesterday at 7:37 PM • 1 reply • view on HN

The "distillation attacks" are mostly using Claude as LLM-as-a-judge. They are not training on the reasoning chains in a SFT fashion.

Replies

zozbot234 • yesterday at 7:45 PM

So they're paying expensive input tokens to extract at best a tiny amount of information ("judgment") per request? That's even less like "distillation" than the other claim of them trying to figure out reasoning by asking the model to think step by step.

➕ show 1 reply

alt Hacker News

Replies