AI Safety as Control of Irreversibility: A Systems Framework for Decision-Energy and Sovereignty Boundaries
Wesley Shu, Peng Wei

TL;DR
This paper introduces a systems framework for AI safety focusing on controlling irreversibility and sovereignty boundaries to prevent irreversible system-level loss amid increasing decision density.
Contribution
It formalizes decision-energy density and identifies sovereignty boundaries, proposing a boundary stabilization theorem to enhance AI safety through layered control and institutional design.
Findings
Decision-energy concentration increases risk of irreversible loss.
Safety can be maintained without proving system correctness.
Institutional designs can prevent irreversible power release.
Abstract
Recent AI systems compress the distance between capability growth and capability deployment. Earlier high-risk technologies were slowed by capital intensity, physical bottlenecks, organizational inertia, and specialized supply chains. By contrast, AI capabilities can be copied, invoked, embedded in workflows, and scaled across institutions at low marginal cost. This paper argues that declining deployment friction changes the safety problem at its root. Safety is not only local output correctness or preference alignment, but the control of irreversibility under rising decision density. The paper formalizes this claim through decision-energy density: the rate-weighted capacity of a node to generate, evaluate, select, and execute consequential decisions. It then identifies three sovereignty boundaries that determine whether AI remains an amplifier within a human-governed system or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
