logoalt Hacker News

wongarsutoday at 12:25 AM1 replyview on HN

But you can't use an old model with a new tokenizer. Changing the tokenizer implies you trained the model from scratch


Replies

dannywtoday at 5:48 AM

A little bit of post-training will fix that. Folks on /r/LocalLLaMa have been making effective finetunes with diff. tokenizers for years.