Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

Yuan Sun

arXiv:2604.27358·cs.AI·May 1, 2026

Safe Bilevel Delegation (SBD): A Formal Framework for Runtime Delegation Safety in Multi-Agent Systems

Yuan Sun

PDF

TL;DR

This paper introduces Safe Bilevel Delegation (SBD), a formal framework enabling dynamic, runtime safety-aware task delegation in hierarchical multi-agent systems, with theoretical guarantees and applications in high-stakes domains.

Contribution

SBD formulates runtime delegation as a bilevel optimization with safety constraints, providing theoretical results and practical instantiations for high-stakes environments.

Findings

01

Higher safety weights lead to safer inner policies.

02

Projected gradient descent converges linearly under standard assumptions.

03

Accountability propagation bounds responsibility across delegation chains.

Abstract

As large language model (LLM) agents are deployed in high-stakes environments, the question of how safely to delegate subtasks to specialized sub-agents becomes critical. Existing work addresses multi-agent architecture selection at design time or provides broad empirical guidelines, but neither provides a runtime mechanism that dynamically adjusts the safety-efficiency trade-off as task context changes during execution. We propose Safe Bilevel Delegation (SBD), a formal framework for runtime delegation safety in hierarchical multi-agent systems. SBD formulates task delegation as a bilevel optimization problem: an outer meta-weight network phi learns context-dependent safety-efficiency weights lambda(s) in [0,1]; an inner loop optimizes the delegation policy pi subject to a probabilistic safety constraint P(safe) >= 1-delta. The continuous delegation degree alpha in [0, 1] controls…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.