Some[Body] Must Receive That Pain for Agent Accountability
Botao Amber Hu, Helena Rong

TL;DR
The paper discusses the importance of consequence reception in AI agents, arguing that without a mechanism for pain or corrective feedback, accountability and future behavior regulation are compromised.
Contribution
It highlights the sociotechnical challenges in designing AI architectures that enable consequence-agency coupling for effective accountability.
Findings
Current AI systems lack mechanisms for consequence reception.
Legal and technical frameworks fail to ensure AI receives behavioral signals.
Proposes that designing AI with consequence-receiving bodies is essential for accountability.
Abstract
AI agents increasingly act consequentially in the real world. This creates a problem we call \emph{consequence reception}: harm occurs, the producing system is identified, yet no continuing agent receives consequences in a way that changes future behavior. Pain, understood mechanistically as a corrective feedback signal, is foundational to canonical theories of punishment -- deterrence, rehabilitation, retribution, and incapacitation all assume a continuing locus that registers the signal and updates behavior. That, in turn, requires a body for the signal to land on: a boundary whose integrity it protects, a locus where it accumulates, consolidation that converts episodic signal into durable update, and a substrate that responds by altering future action. Current LLM agents -- software-defined composites of weights, prompts, tools, memory, and credentials, freely swapped, copied, reset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
