I love to hate it when someone unironically thinks asking an llm how many letters are in a word is a...

halJordan • yesterday at 10:14 PM • 1 reply • view on HN

I love to hate it when someone unironically thinks asking an llm how many letters are in a word is a good test

Replies

It is a good test now, for reasoning models.

It was a terrible test for pure tokenized models, because the logit that carries the carry digit during summation has a decent chance at getting lost.

SOTA models should reason to generate a function that returns the count of a given character, evaluate the function with tests, and use it for the output.

alt Hacker News

Replies