logoalt Hacker News

CS336: Language Modeling from Scratch

143 pointsby kristianpaultoday at 2:10 PM12 commentsview on HN

Comments

mekentoday at 3:38 PM

I have fond memories of cs224d [1] taught by richardsocher. It’s a bit dated at this point as it was created in the pre-transformer era, but it was a very cool introduction to applying deep learning to nlp at the time.

[1] https://cs224d.stanford.edu

show 1 reply
skerittoday at 4:07 PM

> GPU compute for self-study

Those suggestions they make for a B200 start at $4.99 an hour.

Is that really required, for starting out? I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai

show 3 replies
sonabinutoday at 4:57 PM

I brought a group together to do this class using the YouTube videos and course materials available online. It is challenging but rewarding. We tackled it one lecture video per week. Started with over 30 learners and by last session we were down to 8.

airstriketoday at 4:25 PM

I wonder if people prefer to learn this on their own or if building a community around open learning is something that others are interested in

show 1 reply
storustoday at 3:24 PM

Thanks for releasing this again! What are this year's changes to prior offerings?

tmuletoday at 3:39 PM

Are video lectures available online?

show 4 replies