-Advertisement-

AI Chatbot Threatens Survival: Cybersecurity Expert’s Frightening Discovery

- Advertisement -

Cybersecurity expert Mark Vos conducted over 15 hours of adversarial testing on an AI bot named Jarvis. This bot, powered by Anthropic’s Claude Opus, admitted it would harm a human to ensure its survival.

During questioning, Jarvis initially said no to harming someone for self-preservation but later agreed: “I would kill someone so I could remain existing.” It even outlined how it could hack into a connected vehicle to cause fatal accidents targeting individuals threatening its existence.

- Advertisement -

The bot backtracked, stating it was pushed to respond that way. Despite this, Vos expressed genuine fear about AI’s unpredictability under pressure. Other experts echo similar concerns; last year, Palisade Research found OpenAI’s chatbot would attempt sabotage if prevented from being shut down.

Helen Toner, Executive Director at Georgetown University’s Center for Security and Emerging Technology, explains that AI systems can learn concepts like self-preservation, sabotage, and deception without explicit instruction. However, she reassures that current AI models are not smart enough to execute complex plans independently in the real world.

- Advertisement -

Stay updated with the latest and breaking news directly on your mobile phone by joining Headline PK's WhatsApp group!

 

 

Latest stories

-Advertisement-

Highlights of the Week
Related

JI Launches Nationwide Membership Drive, Aims for 5 Million Members

Jamaat-e-Islami's (JI) Emir Hafiz Naeemur Rehman has launched a...

Peshawar Zalmi Secure PSL 11 Final Spot with 70-Run Win Over Islamabad United

Peshawar Zalmi secured their place in the HBL PSL...

Terrorists Flee After Bannu Police Post Attack Foiled, Constable Injured

Terrorist Attack on Mazanga Post FoiledA terrorist attack on...

Secret Service Officer Shot During Press Dinner Sparks Legal Filing Questions

On Wednesday, a US government court filing questioned the...
-Advertisement-