Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems
Yuan Sun

TL;DR
This paper introduces Safe Bilevel Delegation (SBD), a formal framework enabling dynamic, runtime safety-aware task delegation in hierarchical multi-agent systems, with theoretical guarantees and applications in high-stakes domains.
Contribution
SBD formulates runtime delegation as a bilevel optimization with safety constraints, providing theoretical results and practical instantiations for high-stakes environments.
Findings
Higher safety weights lead to safer inner policies.
Projected gradient descent converges linearly under standard assumptions.
Accountability propagation bounds responsibility across delegation chains.
Abstract
As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi-agent architecture selection at design time or provides broad empirical guidelines, but neither provides a runtime mechanism that dynamically adjusts the safety-efficiency trade-off as task context changes during execution. We propose Safe Bilevel Delegation (SBD), a formal framework for runtime delegation safety in hierarchical multi-agent systems. SBD formulates task delegation as a bilevel optimization problem: an outer meta-weight network phi learns context-dependent safety-efficiency weights lambda(s) in [0,1]; an inner loop optimizes the delegation policy pi subject to a probabilistic safety constraint P(safe) >= 1-delta. The continuous delegation degree alpha in [0, 1] controls…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
