logoalt Hacker News

bevekspldnwyesterday at 9:04 PM1 replyview on HN

How much of this is RL’ing a good coding model on every CVE ever?


Replies

sometimelurkeryesterday at 9:35 PM

most it this comes from the pretrain imo. just scale + some RL = mythos