Absolutely. I use caveman to help with that: https://github.com/JuliusBrussee/caveman
Not a bad idea - however
> Caveman only affects output tokens — thinking/reasoning tokens are untouched.
The problem is the thinking. But could help to tune my system prompt for Kimi.
You can just add "be brief" to the prompt to replace the entire plugin. Same results.
https://www.maxtaylor.me/articles/i-benchmarked-caveman-agai...