Anthropic Research going from strength to strength in interpretability. Publicly releasing the code ...

Tossrock • yesterday at 6:56 PM • 0 replies • view on HN

Anthropic Research going from strength to strength in interpretability. Publicly releasing the code so other labs can benefit from it is also a great move - very values aligned, and improves the overall AI safety ecosystem.

alt Hacker News