logoalt Hacker News

sheeshkebabtoday at 3:40 PM1 replyview on HN

…but they reason well enough given enough context (using their matmuls).


Replies

noosphrtoday at 3:51 PM

To this day frontier models think that A and not B means A and B when the sentence gets pushed far enough back in their context window. The context length that model can reason over without obvious errors is much smaller than the advertised context. Between a 1/4th to a 1/20th what is advertised on the tin.

show 2 replies