The issue is that you wouldn't be able to even transparently get to any evidence, as these mode...

flawn • yesterday at 11:13 PM • 1 reply • view on HN

The issue is that you wouldn't be able to even transparently get to any evidence, as these models are blackboxes.

They might start scheming behind employees backs as soon as they realize they are being used in critical infrastructure of adversaries. And nobody would know until it's too late.

Replies

throw-the-towel • today at 11:48 AM

Aren't all LLMs just as blackboxey?

➕ show 1 reply

alt Hacker News

Replies