logoalt Hacker News

woadwarrior01last Sunday at 9:21 PM1 replyview on HN

You'd probably get much further along by fine tuning a small BERT style encoder model based classifier for it. IMO, even something as simple as training a linear classifier on the CLS token embeddings from a frozen encoder might work.


Replies

ahamilton454last Sunday at 10:11 PM

Yeah, Ive tried a bi-encoder, cross encoder and some small LLMs so far. I think I’ll do BERT soon too

show 1 reply