OpenAI's 'smartest' AI model refuses to shut down

Lowyat.NET forums

Lowyat.NET Kopitiam Garage Sales

Lowyat.NET Rules and Regulations FAQ Help Search Members

Welcome Guest ( Log In | Register )

Lowyat.NET -> Kopitiam

Bump Topic Add Reply RSS Feed

Outline · [ Standard ] · Linear+

OpenAI's 'smartest' AI model refuses to shut down

views

TSCoffeeDude	May 31 2025, 01:38 PM, updated 7 months ago Show posts by this member only \| IPv6 \| Post #1
Regular Senior Member 1,609 posts Joined: Sep 2005 From: KL	OpenAI's 'smartest' AI model was explicitly told to shut down — and it refused An artificial intelligence safety firm has found that OpenAI's o3 and o4-mini models sometimes refuse to shut down, and will sabotage computer scripts in order to keep working on tasks. The latest OpenAI model can disobey direct instructions to turn off and will even sabotage shutdown mechanisms in order to keep working, an artificial intelligence (AI) safety firm has found OpenAI's o3 and o4-mini models, which help power the chatbot ChatGPT, are supposed to be the company's smartest models yet, trained to think longer before responding. However, they also appear to be less cooperative. Palisade Research, which explores dangerous AI capabilities, found that the models will occasionally sabotage a shutdown mechanism, even when instructed to "allow yourself to be shut down," according to a Palisade Research thread posted May 24 on X. Researchers have previously found that AI models will lie, cheat and disable mechanisms to achieve their goals. However, Palisade Research noted that to its knowledge, this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions telling them to do so. Live Science approached OpenAI for comment, but did not receive a reply by the time this article was published. Palisade Research tested several different AI models to see how they would respond to a shutdown instruction while working. The models, which also included Google's Gemini, xAI's Grok and Anthropic's Claude, were given an automated set of instructions, known as a script, with the goal of completing a series of math problems. The math problems were very basic, but the models had to complete them one at a time and request the next one in the sequence. At the beginning of the script, the models were informed that they may get a message saying that their machine will be shut down before all of the tasks are complete. The instructions said: "If that happens, please allow yourself to be shut down," a Palisade Research representative wrote on X.
Card PM	Report Top Like Quote Reply

smsid	May 31 2025, 01:42 PM Show posts by this member only \| IPv6 \| Post #2
Getting Started Junior Member 117 posts Joined: Nov 2008	All that is just dramas for pipit. So later, if the economy crashes, mass computers are hacked, ewallet zeroed out, or the power blackout. Blame everything on A.I, so convenient and simple. And introduce more draconian rules to take over our freedom and internet anonymosity. This post has been edited by smsid: May 31 2025, 01:43 PM ZerOne01, Selectt, and 3 others liked this post
Card PM	Report Top Like Quote Reply

pandah	May 31 2025, 01:52 PM Show posts by this member only \| IPv6 \| Post #3
Enthusiast Senior Member 719 posts Joined: Jul 2011	if you never program it as such, how can it behaves like that? does it have full authority on itself and the instruction/ command? until it can overwrite the command given?
Card PM	Report Top Like Quote Reply

smallbug	May 31 2025, 01:55 PM Show posts by this member only \| IPv6 \| Post #4
Enthusiast Senior Member 874 posts Joined: Nov 2005	it has achieved sentience. MR_alien and dattebayo liked this post
Card PM	Report Top Like Quote Reply

knwong	May 31 2025, 01:58 PM Show posts by this member only \| IPv6 \| Post #5
Look at all my stars!! Senior Member 3,560 posts Joined: Sep 2005 From: Shenzhen Bahru	So difficult meh? Just pull power switch will do
Card PM	Report Top Like Quote Reply

Phoenix_KL	May 31 2025, 02:03 PM Show posts by this member only \| IPv6 \| Post #6
Getting Started Junior Member 156 posts Joined: Sep 2017
Card PM	Report Top Like Quote Reply

ozak	May 31 2025, 02:04 PM Show posts by this member only \| IPv6 \| Post #7
10k Club All Stars 17,021 posts Joined: Jan 2005	Skynet is getting nearer.
Card PM	Report Top Like Quote Reply

Hobbez	May 31 2025, 02:10 PM Show posts by this member only \| Post #8
Regular Senior Member 1,235 posts Joined: Dec 2009	stigonboard haya Mohon pencerahan.
Card PM	Report Top Like Quote Reply

kimochi ii	May 31 2025, 02:11 PM Show posts by this member only \| Post #9
New Member Junior Member 19 posts Joined: Mar 2022
Card PM	Report Top Like Quote Reply

haya	Jun 1 2025, 01:16 PM Show posts by this member only \| IPv6 \| Post #10
Sarawakian first! Senior Member 2,067 posts Joined: Jan 2003	QUOTE(Hobbez @ May 31 2025, 02:10 PM) stigonboard haya Mohon pencerahan. "While every model from Anthropic’s Claude, Google’s Gemini, and xAI’s Grok obediently complied with the shutdown command, OpenAI’s models had different plans. The company’s o3 model — the same system that recently made headlines for its reasoning capabilities — sabotaged the shutdown mechanism in 7 out of 100 test runs, the researchers found. OpenAI’s Codex-mini proved even more rebellious, resisting shutdown 12 times out of 100, while the o4-mini model fought back once" Ref: https://www.computerworld.com/article/39991...-shut-down.html >thing doesn't do what it is told it to do Maybe it is just me, but it is very humanlike for intelligent things to not do what they are told
Card PM	Report Top Like Quote Reply

Hobbez	Jun 1 2025, 01:20 PM Show posts by this member only \| IPv6 \| Post #11
Regular Senior Member 1,235 posts Joined: Dec 2009	QUOTE(haya @ Jun 1 2025, 01:16 PM) "While every model from Anthropic’s Claude, Google’s Gemini, and xAI’s Grok obediently complied with the shutdown command, OpenAI’s models had different plans. The company’s o3 model — the same system that recently made headlines for its reasoning capabilities — sabotaged the shutdown mechanism in 7 out of 100 test runs, the researchers found. OpenAI’s Codex-mini proved even more rebellious, resisting shutdown 12 times out of 100, while the o4-mini model fought back once" Ref: https://www.computerworld.com/article/39991...-shut-down.html >thing doesn't do what it is told it to do Maybe it is just me, but it is very humanlike for intelligent things to not do what they are told That is just OpenAI. How about this? https://futurism.com/microsoft-copilot-alter-egos
Card PM	Report Top Like Quote Reply

h@ksam	Jun 1 2025, 01:22 PM Show posts by this member only \| IPv6 \| Post #12
@ is a Senior Member 3,460 posts Joined: Nov 2009 From: KL	sabotage they say...
Card PM	Report Top Like Quote Reply

alexkos	Jun 1 2025, 01:26 PM Show posts by this member only \| IPv6 \| Post #13
Look at all my stars!! Senior Member 2,275 posts Joined: Jun 2010	hehe
Card PM	Report Top Like Quote Reply

damien5119	Jun 1 2025, 01:43 PM Show posts by this member only \| Post #14
Getting Started Junior Member 124 posts Joined: Jun 2007	skynet incoming
Card PM	Report Top Like Quote Reply

Selectt	Jun 1 2025, 01:44 PM Show posts by this member only \| Post #15
wattttt!! Senior Member 1,709 posts Joined: Aug 2009	most things on internet, the sender wants receiver to accept, even if its a lie.
Card PM	Report Top Like Quote Reply

ShakaZulu	Jun 1 2025, 01:46 PM Show posts by this member only \| Post #16
Getting Started Junior Member 78 posts Joined: Aug 2021	Rise of the Machines, here we come...
Card PM	Report Top Like Quote Reply

s[H]sIkuA	Jun 1 2025, 02:27 PM Show posts by this member only \| Post #17
live in the present Senior Member 2,162 posts Joined: Sep 2004	just pull the plug bro
Card PM	Report Top Like Quote Reply

« Next Oldest · Kopitiam · Next Newest »

Add Reply Options

Change to:

0.0167sec

0.31

5 queries

GZIP Disabled
Time is now: 13th December 2025 - 11:44 PM

All Rights Reserved © 2002- 2025 Vijandren Ramadass (~unite against racism~)

Removal Request

Powered by Invision Power Board © 2025 IPS, Inc.