logoalt Hacker News

ac29today at 1:19 PM0 repliesview on HN

None of those settings set the speculative decoder to accept 100% of drafted token. I assume you are looking at --draft-p-min 0.0, if so, you are misunderstanding what it does.