logoalt Hacker News

anana_yesterday at 7:46 PM1 replyview on HN

I've had even better results using the dense 27B model -- less looping and churning on problems


Replies

androiddrewyesterday at 11:29 PM

Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.

show 1 reply