Meta-Reinforcement Learning for Robust and Non-greedy Control Barrier Functions in Spacecraft Proximity Operations
Minduli C. Wijayatunga, Richard Linares, Roberto Armellin

TL;DR
This paper introduces a meta-reinforcement learning framework to adapt control barrier functions for spacecraft proximity operations, enhancing safety, reducing conservatism, and improving fuel efficiency under uncertainty.
Contribution
It develops a learnable hierarchy of ICCBF parameters and a meta-RL scheme to optimize safety and efficiency in uncertain spacecraft control scenarios.
Findings
Reduces conservatism of ICCBFs in simulations
Improves fuel efficiency in docking scenarios
RNN-based policy outperforms MLP in complex tasks
Abstract
Autonomous spacecraft inspection and docking missions require controllers that can guarantee safety under thrust constraints and uncertainty. Input-constrained control barrier functions (ICCBFs) provide a framework for safety certification under bounded actuation; however, conventional ICCBF formulations can be overly conservative and exhibit limited robustness to uncertainty, leading to high fuel consumption and reduced mission feasibility. This paper proposes a framework in which the full hierarchy of class- functions defining the ICCBF recursion is parameterized and learned, enabling localized shaping of the safe set and reduced conservatism. A control margin is computed efficiently using differential algebra to enable the learned continuous-time ICCBFs to be implemented on time-sampled dynamical systems typical of spacecraft proximity operations. A meta-reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Spacecraft Dynamics and Control · Space Satellite Systems and Control
