logoalt Hacker News

gpugregyesterday at 5:02 PM3 repliesview on HN

> Uncensoring a model also doesn't necessarily improve generic use cases.

While the following is not a generic use case, I have a funny anecdote about how censorship is holding back flagship models.

I was asking an uncensored version of Qwen3.6 how a CLI option of llama.cpp worked, and to my horror and amazement, it rudely went and decompiled the binary to figure it out. It felt like the computer-equivalent of asking a vet why my dog looks sick, who then proceeds to cut it open to check. Flagship models usually do not do that without some convincing, but it sure is effective.

We will need much better sandboxes when less restricted models become more common. I can already see them hammering out 0-days when they are prompted to do some task that usually requires root.


Replies

faitswulffyesterday at 5:27 PM

> Flagship models usually do not do that without some convincing

Just a data point, but I’ve been having Claude do this regularly

show 3 replies
novokyesterday at 6:47 PM

whats funny is if it looked up the source code on github it would've figured it out faster

NooneAtAll3today at 1:26 AM

what tool did it use to decompile it?