Bricken isn’t just making this up. He’s one of the leading researchers in model interpretability. Se...

rbranson • last Friday at 1:24 PM • 0 replies • view on HN

Bricken isn’t just making this up. He’s one of the leading researchers in model interpretability. See: https://arxiv.org/abs/2411.14257

alt Hacker News