A Markov Decision Process Framework for Early Maneuver Decisions in Satellite Collision Avoidance
Francesca Ferrara, Lander W. Schillinger Arana, Florian D\"orfler, Sarah H. Q. Li

TL;DR
This paper introduces an MDP and reinforcement learning approach for satellite collision avoidance that optimizes early maneuver decisions to reduce propellant use while maintaining safety.
Contribution
It presents a novel MDP framework combined with RL-PG to optimize satellite collision avoidance guidance policies using historical data.
Findings
Trained policy reduces propellant consumption in synthetic tests.
Policy is slightly more conservative in real conjunctions.
Early maneuvers achieve similar collision risk reduction with less propellant.
Abstract
We develop a Markov decision process (MDP) framework to autonomously make guidance decisions for satellite collision avoidance maneuver (CAM) and a reinforcement learning policy gradient (RL-PG) algorithm to enable direct optimization of guidance policy using historic CAM data. In addition to maintaining acceptable collision risks, this approach seeks to minimize the average propellant consumption of CAMs by making early maneuver decisions. We model CAM as a continuous state, discrete action and finite horizon MDP, where the critical decision is determining when to initiate the maneuver. The MDP models decision rewards using analytical models of collision risk, propellant consumption, and transit orbit geometry. By deciding to maneuver earlier than conventional methods, the Markov policy effectively favors CAMs that achieve comparable rates of collision risk reduction while consuming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpace Satellite Systems and Control · Software Reliability and Analysis Research · Real-Time Systems Scheduling
