logoalt Hacker News

thegrim33yesterday at 8:14 PM0 repliesview on HN

Whether or not the linked tool uses a good approach, manipulating models like you mention is already fairly well established, see: https://huggingface.co/blog/mlabonne/abliteration .