Yeah it's a knowledge benchmark not agentic benchmark.
That's like saying coding benchmarks are about memorizing the language syntax. You have to know what to call when and how. If you get the job done you win.
That's like saying coding benchmarks are about memorizing the language syntax. You have to know what to call when and how. If you get the job done you win.