does the model really improve? i tried several tasks today, and most of them failed, which are super easy ones.
maybe it's just because the gpt5.2 in cursor is super stupid?