This article describes how Transformers work, but not really how LLMs work. Explaining the underlyin...

stalfie • today at 8:24 AM • 0 replies • view on HN

This article describes how Transformers work, but not really how LLMs work. Explaining the underlying architecture gives you about as much insight into how a modern LLM behaves as an breakdown of neuronal biochemistry and a few pathways does for the brain. Meaning, almost no insight at all.

alt Hacker News