logoalt Hacker News

agenticuptoday at 6:13 AM0 repliesview on HN

qwen 3.6 27b and qen35b a3b work like magic, if we get dpark speculative decoding versions of these models it will further improve the throughput