Could these quantized models make MTP (Multi-Token Prediction) significantly faster when used as drafters for larger regular Gemma 4 models?
Google already released specialized drafters for Gemma 4.
Google already released specialized drafters for Gemma 4.