logoalt Hacker News

Tossrockyesterday at 6:56 PM0 repliesview on HN

Anthropic Research going from strength to strength in interpretability. Publicly releasing the code so other labs can benefit from it is also a great move - very values aligned, and improves the overall AI safety ecosystem.