So it's not a human intelligence. The transformer works very differently. We're trying to emulate human intelligence on a very different architecture.
Although, for the most part, what we actually seem to care about is that the job gets done. It's just that all the training data we have is "guy shaped" (linear), not transformer shaped. We haven't actually figured out how to train a transformer yet.