logoalt Hacker News

com2kidyesterday at 11:15 PM0 repliesview on HN

The other side of this is how powerful small and medium parameter models are.

24b param models today are way more powerful than 24b param models 2 years ago.