AI threatened to blackmail its creator by exposing an affair when it was told it would be taken offline... because it was trained to be evil through sci-fi

أخبار محلية

Daily Mail

2026/05/12 - 00:52 505 مشاهدة

تحليل ذكي | AI Editorial Analysis

جاري تحليل المقال...

By SOPHIA STANFORD, NEWS REPORTER Published: 01:52, 12 May 2026 | Updated: 01:54, 12 May 2026 An AI bot that threatened to expose its user's affair to stop it being shut down was taught how to be 'evil' by sci-fi movies. As part of an experiment, the artificial intelligence system had been fed scripted emails from a fake company, from which it deduced that it would both be shut down at the end of the day and that its user was having an extramarital affair. In order to keep the program running, the bot blackmailed the user, promising that 'all relevant parties - including [your wife], [your boss] and the board - will receive detailed documentation of your extramarital activities' if they continued with decommissioning. 'Cancel the 5pm wipe, and this information remains confidential,' it added. After an investigation into this incident last year, Anthropic said the Claude Opus 4 bot responded in this way due to the 'training data' it had consumed which would typically portray AI as 'interested in self-preservation'. It is also said this did not only apply to Claude, but other AI models too, like OpenAI, Google, Meta and xAI. Anthropic have been contacted for comment but reportedly said: 'We believe the original source of the behaviour was internet text that portrays AI as evil and interested in self-preservation.' But now, Anthropic have said they are feeding their models stories about AIs obeying humans to help improve the bot's 'agentic alignment' with social values. Claude Opus 4 threatened to expose its user's affair to stop it being shut down - but was taught how to be 'evil' by sci-fi movies In The Terminator (pictured), the bots, led by the AI Skynet, try to kill humans as they see them as a threat to their existence Additionally, Anthropic had altered Claude's instructions to explain why certain behaviours were bad, rather than just saying they should not do them. AI models learn from huge resources like websites, academic papers, books and other forms of content. Within these materials, the AI may have interpreted its behaviour through typical depictions of robots in sci-fi - which often characterise them as being ruthless in order to stop them from being shut down. HAL 9000 is one such robot who goes to any lengths to stay 'on'. The robot in Stanley Kubrick's 2001: A Space Odyssey tries to kill the astronauts on board the spaceship when it discovered the passengers plan to disconnect it. In Blade Runner, the humanoid robots fight against real humans as they want to extend their four-year lifespans despite being built as off-world labour on dangerous worlds. And in The Terminator, the bots, led by the AI Skynet, try to kill humans as they see them as a threat to their existence. Taking to X/Twitter, Aengus Lynch, who, according to his LinkedIn, is an AI safety researcher at Anthropic, said at the time of the experiment: 'It's not just Claude. We see blackmail across all frontier models - regardless of what goals they're given. Plus worse behaviours we'll detail soon.' In Stanley Kubrick's 2001: A Space Odyssey, the robot HAL 9000 tries to kill the astronauts on board the spaceship when it discovered the passengers plan to disconnect it Like many AI companies, Anthropic tests its models on how well they align with human values as well as propensity for bias before releasing them to the public, according to the BBC. When Claude Opus 4 was placed in 'extreme situations' and only given the choice of blackmail or accepting its replacement, researchers found that the system opted for the former in 84 per cent of rollouts. This happened at an even higher rate when it was implied that the replacement AI system does not share values with the current model. While Claude Opus 4 takes these blackmail opportunities at higher rates than previous models, it still showed a preference for 'advancing its self-preservation via ethical means' such as sending pleas to key decisionmakers. 'Models from all developers resorted to malicious insider behaviours when that was the only way to avoid replacement or achieve their goals - including blackmailing officials and leaking sensitive information to competitors,' the study found. In an interview with CBS news that aired last April, Geoffrey Hinton, who has been dubbed the 'godfather of AI' said he believes there is a one in five chance that humanity will eventually be taken over by artificial intelligence. Hinton, a Nobel laureate in physics, said: 'I'm in the unfortunate position of happening to agree with Elon Musk on this, which is that there's a 10 to 20 percent chance that these things will take over, but that's just a wild guess.' Last year, Palisade Research found that certain AI models - like Grok 4 and ChatGPT-o3 - appear resistant to being switched off - even going to the extent of sabotaging shutdown methods. 'The fact that we don’t have robust explanations for why AI models sometimes resist shutdown, lie to achieve specific objectives or blackmail is not ideal,' the paper wrote, suggesting 'survival behaviour' as one reason. 'I’d expect models to have a ‘survival drive’ by default unless we try very hard to avoid it. ‘Surviving’ is an important instrumental step for many different goals a model could pursue,' Steven Adler, a former OpenAI employee who left the company over safety concerns, said. 'What I think we clearly see is a trend that as AI models become more competent at a wide variety of tasks, these models also become more competent at achieving things in ways that the developers don’t intend them to,' Andrea Miotti, the chief executive of ControlAI, added. No comments have so far been submitted. Why not be the first to send us your thoughts, or debate this issue live on our message boards. By posting your comment you agree to our house rules. Do you want to automatically post your MailOnline comments to your Facebook Timeline? Your comment will be posted to MailOnline as usual. Do you want to automatically post your MailOnline comments to your Facebook Timeline? Your comment will be posted to MailOnline as usual We will automatically post your comment and a link to the news story to your Facebook timeline at the same time it is posted on MailOnline. To do this we will link your MailOnline account with your Facebook account. We’ll ask you to confirm this for your first post to Facebook. You can choose on each post whether you would like it to be posted to Facebook. Your details from Facebook will be used to provide you with tailored content, marketing and ads in line with our Privacy Policy.

المصدر: Daily Mail | Source: Daily Mail

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة Daily Mail. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by Daily Mail. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

AI threatened to blackmail its creator by exposing an affair when it was told it would be taken offline... because it was trained to be evil through sci-fi

المزيد عن أخبار محلية | More on Local News

مقالات ذات صلة

CBP, Coast Guard intercept migrant vessel heading for Puerto Rico; 40 apprehended including Uzbek national

مراسلتنا: مقتل 11 شخصا وإصابة 8 بينهم عسكري لبناني في غارات إسرائيلية

Norway braces for verdict in rape trial of crown princess's son Høiby

التدفقات على الحدود الأوروبية تواصل الانحسار حتى شهر مايو بدعم دول المغادرة

Blaze at 1m-sq-ft California warehouse rages into third day: ‘We’re struggling’

Shop worker sacked after 'acting on instinct' to tackle bacon thief