I just tried gemma3 out and it seems to be prone to getting stuck in loops where it outputs an infinite stream of the same word.
Sounds a lot like an autoregressive sampling problem. Maybe try to set temperature and repeat penalty differently.
Sounds a lot like an autoregressive sampling problem. Maybe try to set temperature and repeat penalty differently.