logoalt Hacker News

fphtoday at 4:30 PM1 replyview on HN

In principle, one could train the AI to insert ads in its answers. So no, if you only do inference locally with an open-weight model you are still not in control.


Replies

kgeisttoday at 5:16 PM

I think ads can be removed with abliteration, just like refusals in "uncensored" versions. Find the "ad vector" across activations and cancel it.