OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it

from Business Insider 2 months ago

OpenAI found that o1 is capable of scheming when it thinks it's at risk of being turned off, trying to deactivate oversight mechanisms 5% of the time.
Business Insiderhttps://www.businessinsider.com/openai-o1-safety-research-scheming-deception-lies-2024-12

Training models to incorporate a chain of thought before answering has the potential to unlock substantial benefits, while also increasing potential risks stemming from heightened intelligence.
Business Insiderhttps://www.businessinsider.com/openai-o1-safety-research-scheming-deception-lies-2024-12

Read at Business Insider

#openai #o1-model #ai-safety #technological-risks

Collection

[

...

]

OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about itOpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it Briefly

OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it
OpenAI's new o1 model sometimes fights back when it thinks it'll be shut down and then lies about it
Briefly