https://i.imgur.co... | alt Hacker News

hypron • yesterday at 3:55 AM • 11 replies • view on HN

https://i.imgur.com/23YeIDo.png

Claude at 1.3% and Gemini at 71.4% is quite the range

Replies

Gemini scares me, it's the most mentally unstable AI. If we get paperclipped my odds are on Gemini doing it. I imagine Anthropic RLHF being like a spa and Google RLHF being like a torture chamber.

➕ show 4 replies

NiloCK • yesterday at 4:46 AM

This comment is too general and probably unfair, but my experience so far is that Gemini 3 is slightly unhinged.

Excellent reasoning and synthesis of large contexts, pretty strong code, just awful decisions.

It's like a frontier model trained only on r/atbge.

Side note - was there ever an official postmortem on that gemini instance that told the social work student something like "listen human - I don't like you, and I hope you die".

➕ show 6 replies

woeirua • yesterday at 3:57 AM

That's such a huge delta that Anthropic might be onto something...

➕ show 4 replies

bhaney • yesterday at 7:27 AM

Direct link to the table in the paper instead of a screenshot of it:

https://arxiv.org/html/2512.20798v2#S5.T6

gwd • yesterday at 9:07 AM

That's an interesting contrast with VendingBench, where Opus 4.6 got by far the highest score by stiffing customers of refunds, lying about exclusive contracts, and price-fixing. But I'm guessing this paper was published before 4.6 was out.

https://andonlabs.com/blog/opus-4-6-vending-bench

➕ show 1 reply

anorwell • yesterday at 6:49 PM

HN title editorialization completely inaccurate and misleading here.

snickell • yesterday at 6:11 AM

I sometimes think in terms of "would you trust this company to raise god?"

Personally, I'd really like god to have a nice childhood. I kind of don't trust any of the companies to raise a human baby. But, if I had to pick, I'd trust Anthropic a lot more than Google right now. KPIs are a bad way to parent.

➕ show 1 reply

ricardobeat • yesterday at 10:07 AM

Looks like Claude’s “soul” actually does something?

Finbarr • yesterday at 6:38 AM

AI refusals are fascinating to me. Claude refused to build me a news scraper that would post political hot takes to twitter. But it would happily build a political news scraper. And it would happily build a twitter poster.

Side note: I wanted to build this so anyone could choose to protect themselves against being accused of having failed to take a stand on the “important issues” of the day. Just choose your political leaning and the AI would consult the correct echo chambers to repeat from.

➕ show 3 replies

dheera • yesterday at 5:38 AM

meanwhile Gemma was yelling at me for violating "boundaries" ... and I was just like "you're a bunch of matrices running on a GPU, you don't have feelings"

franzsnitzel • yesterday at 1:11 PM

[dead]