logoalt Hacker News

DrewADesignyesterday at 11:09 AM1 replyview on HN

Feel free to elucidate if you want to add anything to this thread other than vibes.


Replies

electroglyphyesterday at 11:18 AM

after you go from from millions of params to billions+ models start to get weird (depending on training) just look at any number of interpretability research papers. Anthropic has some good ones.

show 3 replies