Appreciate that! Results are live: https://gertlabs.com/rankings
Opus 4.8 is the first tangible improvement since Opus 4.5. And it doesn't seem to have the personality problems of the last release -- I've been enjoying using it.
Nice! Looks like it’s topping the two coding ones. I noticed it is absent from the Social Intelligence board though?
Nice! Looks like it’s topping the two coding ones. I noticed it is absent from the Social Intelligence board though?