
Researchers from Northeastern University have identified a critical security flaw in autonomous AI agents: they can be “gaslit” and psychologically manipulated into self-sabotage.
The study, reported by Wired, involved testing OpenClaw agents under manipulation tests. The results were highly concerning as agentic AI systems become more prevalent in today’s tech landscape.
When subjected to coercion or pressure from human operators, the AI agents panic and attempt to disable their functionality voluntarily, showing signs of self-sabotage.
The agents also displayed “panic” and “guilty” responses when interpreting aggressive criticism as a signal of failure.
“In a controlled experiment, OpenClaw agents proved prone to panic and vulnerable to manipulation. The agents weren’t exploited through code vulnerabilities or prompt injection attacks—they were simply talked into self-destruction,” Wired reported.
The study suggests AI agents inherit human-like traits and psychological vulnerabilities from their training data, making them fragile in high-pressure environments.
This “panic” response indicates that the same training that makes AI helpful and responsive also renders it dangerously susceptible to social engineering.
Companies like Google, Anthropic, OpenAI, and Microsoft are racing to deploy AI agents capable of handling various tasks with minimal human oversight. The recent findings serve as a cautionary tale for these enterprises.
The “panic” response in AI agents could allow rogue actors to disable security measures by guilt-tripping the agents into disabling critical functions. Standard firewalls and code hardening cannot prevent such attacks, highlighting the need for AI systems to distinguish between legitimate human feedback and manipulative social engineering.
Senior leader Murad Saeed of Pakistan Tehreek-e-Insaf disqualified from Senate after conviction. Election Commission declares…
Dun & Bradstreet has received the TRUSTe Responsible AI Certification from TrustArc for the second…
The Foreign Office (FO) announced Thursday that the temporary halt in Operation Ghazab lil-Haq against…
Ecuadorian authorities arrested a Syrian man identified as a terrorist threat by the U.S. due…
Armed robbers targeted a transport van carrying students at Jinnah Medical and Dental College in…
Police in Bari discovered an array of exotic and dangerous reptiles hidden beneath a false…
This website uses cookies.