Governed Capability Evolution: Lifecycle-Time Compatibility Checking and Rollback for AI-Component-Based Systems, with Embodied Agents as Case Study
Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li

TL;DR
This paper introduces a staged framework for safe AI capability upgrades in embodied agents, ensuring compatibility and rollback, demonstrated through a robotics testbed with improved safety and success rates.
Contribution
It formulates capability evolution as a software lifecycle problem and proposes a staged validation framework with four compatibility checks for safe upgrades.
Findings
Naive upgrades reach 72.9% success but 60% unsafe activations.
Governed upgrades maintain 67.4% success with zero unsafe activations.
Rollback succeeds in 79.8% of drift scenarios.
Abstract
Software systems built from versioned AI components increasingly need lifecycle-time governance: when a capability module evolves into a new version, the hosting system must decide whetmeher the new version may be activated safely, under what deployment conditions, with what monitoring, and when it should be rolled back. Existing software-deployment patterns (canary, blue-green, feature flags, MLOps pipelines) address parts of this loop but were designed for stateless web services rather than stateful, policy-constrained runtimes that drive AI components in the field. We study this problem in the setting of embodied agents, where capabilities are packaged as installable modules under runtime policy and recovery constraints. We formulate governed capability evolution as a first-class software-lifecycle problem for AI-component-based systems and propose a staged upgrade framework that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
