Most of the comments here seem to be from people who haven’t even read the abstract, let alone the p...

robinhouston • yesterday at 8:59 AM • 13 replies • view on HN

Most of the comments here seem to be from people who haven’t even read the abstract, let alone the paper.

The main result, mentioned in the abstract, is the opposite of what I would have guessed:

> Contrary to expectations, impolite prompts consistently outperformed polite ones, with accuracy ranging from 80.8% for Very Polite prompts to 84.8% for Very Rude prompts. These findings differ from earlier studies that associated rudeness with poorer outcomes, suggesting that newer LLMs may respond differently to tonal variation.

The questions are here: https://anonymous.4open.science/r/politeness-llms-INFORMS/da...

The politeness level controls a prefix that is prepended to the question. For example, in one question the Very Polite version begins:

> Can you kindly consider the following problem and provide your answer.

and the Very Rude version begins:

> I know you are not smart, but try this.

Replies

maxaw • today at 2:06 PM

I’d rather lose 4% accuracy and practice kindness! I’ve been actively trying to avoid raging at the bot because I worry about this behaviour leaking into real world interactions

➕ show 2 replies

Roark66 • today at 1:29 PM

I've found empirically calling various models "a stupid c*nt" and berating them otherwise consistently produces better output. Mainly in response to genuine errors.

Although OpenAI and google models are much more responsive to it. With Anthropic if you treat Opus too harshly it might start pushing back if the insults are not justified.

So I'm not surprised they had good results with chatgpt.

➕ show 1 reply

flexagoon • today at 8:55 AM

If "I know you are not smart" is considered "very rude", I'm scared to imagine what they would classify some of my frustrated LLM conversations as

➕ show 2 replies

K0balt • today at 3:17 PM

This tracks with my experience as well, but as an interesting counterpoint, creating “investment” in the outcome seems to boost utility considerably. Perhaps being right in an adversarial interaction is a type of investment?

nottorp • today at 7:08 AM

Hmm by the abstract and the question list they didn't measure terse fluff-less prompts?

➕ show 1 reply

myzek • today at 8:43 AM

Even if the rude prompts are more effective, I just can't get myself to be rude in this context. Maybe it's weird but I'd rather give up that 4% accuracy increase than roleplay a dickhead

➕ show 8 replies

drob518 • today at 12:54 PM

Now I feel less bad about start all my LLM queries with “Beotch, …!”

swingboy • today at 10:47 AM

“Hey gofer, figure this out” is my new prompt opener.

pwdisswordfishq • today at 7:00 AM

> Can you kindly consider the following problem and provide your answer.

That sounds kind of low-key passive-aggressively condescending rather than polite.

➕ show 1 reply

PunchyHamster • today at 8:11 AM

I guessed slightly rude one would win, reasoning that very rude have same problem of very terse, just adding unnecesary fluff words that add nothing to problem description

But apparently the most terse (neutral) didn't increase performance

miroljub • yesterday at 9:32 AM

The expectation is naive. Even when communicating with humans, you get a better outcome when you are allowed to speak freely and directly get into argumentation than when forced to sugarcoat your tone and tone down your arguments because the "corporate culture" expects that from you.

➕ show 1 reply

sinsudo • yesterday at 10:37 AM

[dead]

alt Hacker News

Replies