logoalt Hacker News

paradox460today at 12:57 AM0 repliesview on HN

In my experience it spends a lot more tokens to do things. I wrote a tiny extension for omp that counts the number of "Actually" in the response, and if it exceeds a threshold stops execution and waits for me to tell it what to do. Even then it frequently just ignores basic instructions like "only write boilerplate, I will fill in the functionality"

Imo MiniMax and MiMo are a lot more reliable (and cheap)

Not opus level, but close enough and cheap enough to get the job done