Learning Safety-Guaranteed, Non-Greedy Control Barrier Functions Using Reinforcement Learning

Minduli Wijayatunga; Nathan Wallace; Salah Sukkarieh; Roberto Armellin

arXiv:2602.00366·math.OC·February 3, 2026

Learning Safety-Guaranteed, Non-Greedy Control Barrier Functions Using Reinforcement Learning

Minduli Wijayatunga, Nathan Wallace, Salah Sukkarieh, Roberto Armellin

PDF

Open Access

TL;DR

This paper introduces a two-stage reinforcement learning framework for spacecraft control that guarantees safety, improves fuel efficiency, and maintains real-time computational complexity by adaptively managing safety constraints.

Contribution

It develops a novel RL-based approach that combines adaptive safety parameters with residual barrier functions to enhance safety and efficiency in safety-critical spacecraft operations.

Findings

01

Reduces median fuel consumption by 12-25% compared to ICCBF baselines.

02

Increases trajectories remaining within safe set S by 7-8%.

03

Maintains real-time quadratic program complexity.

Abstract

Spacecraft rendezvous and proximity operations (RPO) pose safety risks to high-value assets, so formal safety guarantees are critical. Yet conservative safety controllers can reduce mission efficiency. We propose a unified two-stage reinforcement learning (RL) framework that addresses two complementary limitations of Input-Constrained Control Barrier Functions (ICCBFs) for safety-critical, fuel-limited spacecraft control. Given a certified safe set S, ICCBFs guarantee forward invariance of an inner set C* under input bounds, but the resulting per-step quadratic programme (QP) is greedy and fuel-inefficient within C*, and recoverable states outside C* are conservatively discarded. Stage 1 learns state-dependent class-K-infinity parameters that adapt ICCBF/CLF decay rates, embedding long-horizon cost awareness while preserving invariance in C*. Stage 2 learns a residual barrier h_RL(x)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpacecraft Dynamics and Control · Space Satellite Systems and Control · Adaptive Dynamic Programming Control