Opus 4.5 was definitely stronger than DeepSeek V4 for me, specifically with large context.
I’m being pedantic/splitting hairs, though. I’ve obviously switched to DeepSeek full-time because it makes more sense to me pragmatically — I spend a few more tokens to get the outcome I want, but the tokens are cheap as dirt and the API is faster.
Perhaps I should plug it into Claude Code and see how it performs? I haven’t tried that.