logoalt Hacker News

janalsncmtoday at 7:08 PM1 replyview on HN

(For reference I’m talking about the DFT post from the same blog.) I love that ML is still in the “gentleman researcher” stage where relatively small amounts of startup capital can buy a ticket into frontier research.

For a lot of research questions 6 GPUs is even overkill.

It’s one of the reasons I’m skeptical of the “trillion dollar supercluster” idea [0]. I think what we need is more reasonably smart people investigating medium-sized problems. A “GPU middle class” you might say.

[0] https://situational-awareness.ai/racing-to-the-trillion-doll...


Replies

rosminetoday at 8:35 PM

I agree :) Also, I heard Teknium trained the original Hermes model on 2x 4090. You can do a lot with a little compute