One measure is the ability to find an optimal solution among many options. From solving an integration problem (option = equation, optimal = correct) to life (option = life choice, optimal = best QOL).
Statistically, an LLM finds a better optimal solution than one individual. But LLM solutions are more similar to each other than individuals’, so among many individuals, a few find a better optimal solution than the LLMs.
Hence why I think we need AI that responds more uniquely. Whether that be fine-tuned local models, one LLM that’s more influenced by its prompt and well-supports extremely long prompts, AI tools for humans, or something else. I’d love to see a local AI with continuous learning that generates complex UX to better interact with the user (more than prompts), but that may be far-fetched…