logoalt Hacker News

flawnyesterday at 11:13 PM1 replyview on HN

The issue is that you wouldn't be able to even transparently get to any evidence, as these models are blackboxes.

They might start scheming behind employees backs as soon as they realize they are being used in critical infrastructure of adversaries. And nobody would know until it's too late.


Replies

throw-the-toweltoday at 11:48 AM

Aren't all LLMs just as blackboxey?

show 1 reply