> but extrapolating LLM behavior based on human behavior is not productive.
The training process for the foundation model is to make sure we can do this in a very statistically significant way.
My favorite example is AI "getting tired" and "lazy" during long coding session. Why would they do that? Because humans get tired. It's in the data! I always throw in a periodic "Great work, let's take a break and finish this up on Monday. Have a great weekend!" (And then immediately resume). I wish someone would benchmark this concept.
> My favorite example is AI "getting tired" and "lazy" during long coding session
Never seen this even once, nor anyone I know ever reported this. Do you have an example?
> AI "getting tired" and "lazy" during long coding session. Why would they do that? Because humans get tired.
When a LLM is tired and lazy, how does it recharge and regain motivation?
Humans... sleep or drink some coffee.
LLMs.... idk, you prompt it to try harder? You prompt it to be less tired?
This is what I mean when I say extrapolating LLM behavior based on human behavior is cute.. but usually not useful.