logoalt Hacker News

A tool that removes censorship from open-weight LLMs

130 pointsby mvdwoordyesterday at 2:27 PM57 commentsview on HN

Comments

a2128yesterday at 7:49 PM

    You're not just using a tool — you're co-authoring the science.
This README is an absolute headache that is filled with AI writing, terminology that doesn't exist or is being used improperly, and unsound ideas. For example, it focuses a lot on doing "ablation studies", by which it means removing random layers of an already-trained model, to find the source of the refusals(?), which is an absolute fool's errand because such behavior is trained into the model as a whole and would not be found in any particular layer. I can only assume somebody vibe-coded this and spent way too much time being told "You're absolutely right!" bouncing back the worst ideas
show 10 replies
g947otoday at 2:59 AM

Went through the README but still have no idea how well this works, in terms of removing the censorship while minimally degrading the quality of responses. Well to be honest I can't tell if this works at all or is just an idea.

ComputerGuruyesterday at 7:22 PM

Reviews of the tool on twitter indicate that it completely nerfs the models in the process. It won't refuse, but it generates absolutely stupid responses instead.

show 8 replies
Alifatiskyesterday at 7:20 PM

This is for local models right? I can't use it on, say my glm-5 subscription connected to opencode?

show 1 reply
PeterStueryesterday at 8:39 PM

Already censored for sharing on FB Messenger?

littlestymaaryesterday at 7:35 PM

Don't use this 2 days old vibe coded bullshit please.

p-e-w's Heretic (https://news.ycombinator.com/item?id=45945587) is what you're looking for if you're looking for an automatic de-censoring solution.

ftkftkyesterday at 9:50 PM

Didn't make it past the first paragraph of AI slop in the README. Have some respect for your readers and put actual information in it, ideally human generated. At least the first paragraph! Otherwise you may as well name it IGNOREME.

SilverElfinyesterday at 10:20 PM

Does anyone offer a live (paid) LLM chatbot / video generation / etc that is completely uncensored? Like not requiring doing any work except just paying for it?

show 2 replies
measurablefuncyesterday at 8:01 PM

This is another instance of avant-garde "art".

aplomb1026today at 12:32 AM

[dead]

greenpizza13yesterday at 6:14 PM

Never stopped to ask if they should...