logoalt Hacker News

jchengyesterday at 5:45 PM0 repliesview on HN

Here are some examples of the questions in the benchmark. If these are representative, they seem pretty cut and dry. https://artificialanalysis.ai/evaluations/omniscience#exampl...