Where are you getting those benchmark figures from? Math-500 should be closer to 98% for both models... | alt Hacker News

alt Hacker News

yorwba • today at 12:01 PM • 0 replies • view on HN

Where are you getting those benchmark figures from? Math-500 should be closer to 98% for both models: https://artificialanalysis.ai/evaluations/math-500?models=de...