logoalt Hacker News

jmyeyesterday at 1:26 PM1 replyview on HN

Genuine questions: what do you think the request was? To build the plan? To prepare the commit? Do you never have a second thought after looking at your output, or realize you forgot something you wanted to include? Could it be that they saw "one new function", thought "boy, there should really be two... what happened?" and changed their mind?

To your original comment, it would be like calling your intern to ask them to order lunch, and them letting you know the sandwich place you asked them to order from was closed, and should they just put in an order for next Tuesday at an entirely different restaurant instead? And then that intern hearing, "no, that's not what I want" saying "well, I don't respect your 'no'" and doing it anyways.

"Do X" -> "Here are the anticipated actions (which might deviate from your explicit intent), should I implement?" -> "no, that's not actually what I want"

is a clear instruction set and a completely normal thought pattern.


Replies

vova_hn2yesterday at 2:02 PM

My point is that I don't see any scenario in which sending a ton of input tokens (the whole past conversation) and expecting output that literally does nothing is absolutely pointless.

Like, sure, if the model were "smarter" it would probably generate something like "Okay, I won't do it.". What is the value of the response "Okay, I won't do it."? Why did you just waisted time and compute to generate it?

> Do you never have a second thought after looking at your output, or realize you forgot something you wanted to include? Could it be that they saw "one new function", thought "boy, there should really be two... what happened?" and changed their mind?

Sure, all of those are totally valid. And in each of this cases it would be better to just don't make the request at all or make the request with the correction.

> like calling your intern

LLM is not human. It can't act on it's own without you triggering it to act. Unlike the intern in your example who will be wasting time and getting frustrated if they don't receive a response from you. With model you can just abandon this "conversation" (which is really just a growing context that you send again and again with every request) forever or until you are ready to continue it. There is no situation when just adding "no" to the conversation is useful.