> Some human still has to be accountable. Someone has to get fired / go to jail ...

Hendrikto · 2026-03-03T13:46:54 1772545614

You are making a lot of assumptions here. You assume, among other things, that AI has self-preservation drive, can be threatened, can be motivated, and above all that we know how to accomplish that and are already doing so. I would dispute all of that.

yes_man · 2026-03-03T13:57:22 1772546242

For now maybe not. (Maybe).

But just as evolution in nature, isn’t it likely that in the future the AIs that have a preservation drive are the ones that survive and proliferate? Seeing they optimize for their survival and proliferation, and not blindly what they were trained on.

I am not discounting this happening already, not by the LLMs necessarily being sentient but at least being intelligent enough to emulate sentience. It’s just that for now, humanity is in control of what AI models are being deployed.

cess11 · 2026-03-03T14:16:12 1772547372

Is this an expectation you have towards, say, NPC:s in games?

yes_man · 2026-03-03T15:42:59 1772552579

Put an LLM inside the NPCs in an open world RPG full of dangerous enemies. The LLMs that are more prone to emulate self-preservation will be more likely to survive over ones that have a lesser drive.

We should not act surprised if that generalizes to some degree to for example AI agents. Ones that emulate self-preservation might optimize for behavior that results in those models becoming more successful, more popular. And this feedback loop might embed more such properties into future iterations of the models.

adithyassekhar · 2026-03-03T16:28:12 1772555292

Claude does this if you keep pestering it about something, it will go from friendly to shooing away you.