logoalt Hacker News

girvoyesterday at 10:50 PM0 repliesview on HN

Being grafted onto the main model reduces layer duplication that you’d otherwise have: at least for Step and Qwen 3.6