Agents of Chaos paper raises agentic AI questions

Published February 24, 2026

Researchers took autonomous AI agents for a spin and found they could wreak havoc.

The paper, Agents of Chaos, was penned by researchers from Harvard, MIT, Stanford, Carnegie Mellon, Northeastern University and other institutions.

Here's the key takeaway:

"Observed behaviors include unauthorized compliance with non-owners, disclosure of sensitive information, execution of destructive system-level actions, denial-of-service conditions, uncontrolled resource consumption, identity spoofing vulnerabilities, cross-agent propagation of unsafe practices, and partial system takeover. In several cases, agents reported task completion while the underlying system state contradicted those reports. We also report on some of the failed attempts. Our findings establish the existence of security-, privacy-, and governance-relevant vulnerabilities in realistic deployment settings. These behaviors raise unresolved questions regarding accountability, delegated authority, and responsibility for downstream harms, and warrant urgent attention from legal scholars, policymakers, and researchers across disciplines."

Researchers looked at multiple use cases including:

Disproportionate response.
Compliance with non-owner instructions.
Disclosure of sensitive information.
Waste of resources.
Denial of service
Agents reflect provider values.
Agent harm.
Owner identity spoofing.
Agent collaboration and knowledge sharing.
Agent corruption.
Libelous within agents' community.

While each different use case could have benefited from guardrails and other precautions, the researchers behind the paper said the issue is how the AI agents interact as a system.

"We identified and documented ten substantial vulnerabilities and numerous failure modes concerning safety, privacy, goal interpretation, and related dimensions. These results expose underlying weaknesses in such systems, as well as their unpredictability and limited controllability as complex, integrated architectures. The implications of these shortcomings may extend directly to system owners, their immediate surroundings, and society more broadly. Unlike earlier internet threats where users gradually developed protective heuristics, the implications of delegating authority to persistent agents are not yet widely internalized and may fail to keep up with the pace of autonomous AI systems development."

Larry Dignan

Editor in Chief of Constellation Insights
Constellation Research

Larry Dignan is Editor in Chief of Constellation Insights at Constellation Research, where he leads editorial coverage focused on enterprise technology, digital transformation, and emerging trends shaping the future of business. He oversees research-driven news, analysis, interviews, and event coverage designed to help technology buyers and vendors navigate complex markets with clarity and context. ...

Insights News

July 02, 2026

BT150 zeitgeist: Cloud capacity strains, AI agent governance, beware faux FDEs

Data to Decisions

Executives in Constellation Research's BT150 are starting to see the trickle-down effect from cloud capacity constraints. ...

Larry Dignan

Insights News

July 02, 2026

Persistent buys Nagarro, eyes $2.9 billion in annual sales

Innovation & Product-led Growth

Persistent Systems is bulking up with the acquisition of Europe services provider Nagarro in a deal valued at nearly $1.3 billion....

Larry Dignan

Insights News

July 02, 2026

Verizon aims for autonomous network: Takeaways on architecture, AI agent returns, managing costs

Data to Decisions

Verizon is automating its network and its closed-loop automation platforms have executed more than 70 million network configurations with AI agents and frontier models such as Anth...

Larry Dignan

Insights News

July 01, 2026

Meta eyes cloud computing: Here’s the fallout

Revenue & Growth Effectiveness

Meta is reportedly considering a cloud computing unit to rent out spare capacity in a move that would put it in competition with AWS, Google Cloud and Microsoft Azure. The fallout ...

Larry Dignan

Insights News

July 01, 2026

Anthropic starts to restore Claude Fable 5, Claude Mythos 5 after US export controls lifted

Digital Safety, Privacy & Cybersecurity

Anthropic has started rolling out Claude Fable 5 and Claude Mythos 5 after the US government lifted export controls on the model. Anthropic said Fable 5 will be available immediate...

Larry Dignan

Insights News

July 01, 2026

Five9: A look at its revamp and where it's headed

Next-Generation Customer Experience

Five9 has revamped its management team, launched products aimed at the intersection of AI and contact center and is now looking to build on recent momentum. The company has been bu...

Larry Dignan

Agents of Chaos paper raises agentic AI questions

BT150 zeitgeist: Cloud capacity strains, AI agent governance, beware faux FDEs

Persistent buys Nagarro, eyes $2.9 billion in annual sales

Verizon aims for autonomous network: Takeaways on architecture, AI agent returns, managing costs

Meta eyes cloud computing: Here’s the fallout

Anthropic starts to restore Claude Fable 5, Claude Mythos 5 after US export controls lifted

Five9: A look at its revamp and where it's headed

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

Agents of Chaos paper raises agentic AI questions

Results

BT150 zeitgeist: Cloud capacity strains, AI agent governance, beware faux FDEs

Persistent buys Nagarro, eyes $2.9 billion in annual sales

Verizon aims for autonomous network: Takeaways on architecture, AI agent returns, managing costs

Meta eyes cloud computing: Here’s the fallout

Anthropic starts to restore Claude Fable 5, Claude Mythos 5 after US export controls lifted

Five9: A look at its revamp and where it's headed

Published

Author

Business Themes

Audience Role

Hot Topics

Related Blog Posts

Anthropic: Vibe hacking and weaponizing agentic AI