logoalt Hacker News

sigmoid10last Thursday at 6:29 PM1 replyview on HN

Sounds a lot like an autoregressive sampling problem. Maybe try to set temperature and repeat penalty differently.


Replies

ghclast Thursday at 11:42 PM

You're right, I should have checked the model settings. For some reason the default model profile in Ollama had temperature set to 0. Changing the temperature and repeat penalty worked much better than it did when I tried to correct similar behavior in the smallest phi4 reasoning model.

show 1 reply