Learning Safety-Guaranteed, Non-Greedy Control Barrier Functions Using Reinforcement Learning
Minduli Wijayatunga, Nathan Wallace, Salah Sukkarieh, Roberto Armellin

TL;DR
This paper introduces a two-stage reinforcement learning framework for spacecraft control that guarantees safety, improves fuel efficiency, and maintains real-time computational complexity by adaptively managing safety constraints.
Contribution
It develops a novel RL-based approach that combines adaptive safety parameters with residual barrier functions to enhance safety and efficiency in safety-critical spacecraft operations.
Findings
Reduces median fuel consumption by 12-25% compared to ICCBF baselines.
Increases trajectories remaining within safe set S by 7-8%.
Maintains real-time quadratic program complexity.
Abstract
Spacecraft rendezvous and proximity operations (RPO) pose safety risks to high-value assets, so formal safety guarantees are critical. Yet conservative safety controllers can reduce mission efficiency. We propose a unified two-stage reinforcement learning (RL) framework that addresses two complementary limitations of Input-Constrained Control Barrier Functions (ICCBFs) for safety-critical, fuel-limited spacecraft control. Given a certified safe set S, ICCBFs guarantee forward invariance of an inner set C* under input bounds, but the resulting per-step quadratic programme (QP) is greedy and fuel-inefficient within C*, and recoverable states outside C* are conservatively discarded. Stage 1 learns state-dependent class-K-infinity parameters that adapt ICCBF/CLF decay rates, embedding long-horizon cost awareness while preserving invariance in C*. Stage 2 learns a residual barrier h_RL(x)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpacecraft Dynamics and Control · Space Satellite Systems and Control · Adaptive Dynamic Programming Control
