A tool that removes censorship from open-weight LLMs

130 points • by mvdwoord • yesterday at 2:27 PM • 57 comments • view on HN

Comments

    You're not just using a tool — you're co-authoring the science.

This README is an absolute headache that is filled with AI writing, terminology that doesn't exist or is being used improperly, and unsound ideas. For example, it focuses a lot on doing "ablation studies", by which it means removing random layers of an already-trained model, to find the source of the refusals(?), which is an absolute fool's errand because such behavior is trained into the model as a whole and would not be found in any particular layer. I can only assume somebody vibe-coded this and spent way too much time being told "You're absolutely right!" bouncing back the worst ideas

➕ show 10 replies

g947o • today at 2:59 AM

Went through the README but still have no idea how well this works, in terms of removing the censorship while minimally degrading the quality of responses. Well to be honest I can't tell if this works at all or is just an idea.

ComputerGuru • yesterday at 7:22 PM

Reviews of the tool on twitter indicate that it completely nerfs the models in the process. It won't refuse, but it generates absolutely stupid responses instead.

➕ show 8 replies

Alifatisk • yesterday at 7:20 PM

This is for local models right? I can't use it on, say my glm-5 subscription connected to opencode?

➕ show 1 reply

PeterStuer • yesterday at 8:39 PM

Already censored for sharing on FB Messenger?

littlestymaar • yesterday at 7:35 PM

Don't use this 2 days old vibe coded bullshit please.

p-e-w's Heretic (https://news.ycombinator.com/item?id=45945587) is what you're looking for if you're looking for an automatic de-censoring solution.

ftkftk • yesterday at 9:50 PM

Didn't make it past the first paragraph of AI slop in the README. Have some respect for your readers and put actual information in it, ideally human generated. At least the first paragraph! Otherwise you may as well name it IGNOREME.

SilverElfin • yesterday at 10:20 PM

Does anyone offer a live (paid) LLM chatbot / video generation / etc that is completely uncensored? Like not requiring doing any work except just paying for it?

➕ show 2 replies

measurablefunc • yesterday at 8:01 PM

This is another instance of avant-garde "art".

aplomb1026 • today at 12:32 AM

[dead]

greenpizza13 • yesterday at 6:14 PM

Never stopped to ask if they should...

alt Hacker News

A tool that removes censorship from open-weight LLMs

Comments