logoalt Hacker News

coder543last Friday at 11:41 AM1 replyview on HN

Every model release gets accused of that, including the flagship models.


Replies

naaskinglast Friday at 12:36 PM

Less so for Gemma-4 because it falls behind Qwen on benchmarks. Tests for benchmaxxing are also strongly suggestive: https://x.com/bnjmn_marie/status/2041540879165403527

show 1 reply