alt
Hacker News
bigyabai
•
yesterday at 9:34 PM
•
0 replies
•
view on HN
You won't be RAM caching much of anything with experts that are 220b parameters worth of layers.