> GPU compute for self-study Those suggestions they make for a B200 start at $4.99 an hour. I...

skerit • today at 4:07 PM • 5 replies • view on HN

> GPU compute for self-study

Those suggestions they make for a B200 start at $4.99 an hour.

Is that really required, for starting out? I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai

Replies

_0ffh • today at 5:51 PM

You're right to be sceptical. I have trained reasonably good SLMs for the TinyStories dataset on my 4060Ti (16GB) with no problems. You'll only encounter problems if you want to try if your ideas scale up to models any bigger than "arguably tiny".

marcelroed • today at 5:40 PM

TA here. Definitely not! In fact we explicitly added sections in the first assignment to allow for scaling down to even local compute (M-series GPUs). For assignment 2 there are a few regions that require Triton support for your GPU, but everything can be adapted for much cheaper GPUs.

We were lucky enough to get Blackwell GPUs for Stanford students this year, which is why the writeups are written mostly around them.

grahameb • today at 5:23 PM

It seems strange that the required resources aren't provided by the educational institution?

➕ show 2 replies

root-parent • today at 4:34 PM

You dont even need a GPU to train your own LLM.

flakiness • today at 5:02 PM

I beliee these are affordable enough for the intended audience (which is Stanford undergrad/master)

➕ show 1 reply

alt Hacker News

Replies