Is this [0] saying that unsloth's versions of Google's QAT models are better than Google's own QAT models? Or am I not understanding it correctly?
[0] https://unsloth.ai/docs/models/gemma-4/qat#qat-analysis
It's saying it's better than naively truncating the QAT release to 4 bits.
It's saying it's better than naively truncating the QAT release to 4 bits.