This model is small enough that it might be sensible to try the same prompts against all of the quant sizes to try and spot any differences.
This inspired me to give that a go: https://simonw.github.io/granite-4.1-3b-gguf-pelicans/
This inspired me to give that a go: https://simonw.github.io/granite-4.1-3b-gguf-pelicans/