logoalt Hacker News

KaoruAoiShihoyesterday at 5:57 PM0 repliesview on HN

In my testing this model is quite bad and far behind 235b a22b. https://fiction.live/stories/Fiction-liveBench-Sept-12-2025/...