Microgpt explained interactively

134 points • by growingswe • today at 9:43 AM • 8 comments • view on HN

Comments

> By the end of training, the model produces names like "kamon", "karai", "anna", and "anton". None of them are copies from the dataset.

Hey, I am able to see kamon, karai, anna, and anton in the dataset, it'd be worth using some other names: https://raw.githubusercontent.com/karpathy/makemore/988aa59/...

➕ show 3 replies

malnourish • today at 9:44 PM

I read through this entire article. There was some value in it, but I found it to be very "draw the rest of the owl". It read like introductions to conceptual elements or even proper segues had been edited out. That said, I appreciated the interactive components.

➕ show 1 reply

windowshopping • today at 9:10 PM

The part that eludes me is how you get from this to the capability to debug arbitrary coding problems. How does statistical inference become reasoning?

For a long time, it seemed the answer was it doesn't. But now, using Claude code daily, it seems it does.

➕ show 2 replies

nimbus-hn-test • today at 8:29 PM

[dead]

alt Hacker News

Microgpt explained interactively

Comments