Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning
Carlos M. Casas, Belen Carro, and Antonio Sanchez-Esguevillas

TL;DR
This paper demonstrates that a model-free reinforcement learning agent can autonomously design and control low-thrust satellite trajectories without prior knowledge of dynamics, achieving near-optimal guidance in uncertain environments.
Contribution
It introduces a dynamics-agnostic reinforcement learning approach for low-thrust orbital transfer, eliminating the need for precomputed data or detailed models.
Findings
Agent learns quasi-optimal guidance law
Responds well to environmental uncertainties
Enables autonomous trajectory control
Abstract
Low-thrust trajectory design and in-flight control remain two of the most challenging topics for new-generation satellite operations. Most of the solutions currently implemented are based on reference trajectories and lead to sub-optimal fuel usage. Other solutions are based on simple guidance laws that need to be updated periodically, increasing the cost of operations. Whereas some optimization strategies leverage Artificial Intelligence methods, all of the approaches studied so far need either previously generated data or a strong a priori knowledge of the satellite dynamics. This study uses model-free Reinforcement Learning to train an agent on a constrained pericenter raising scenario for a low-thrust medium-Earth-orbit satellite. The agent does not have any prior knowledge of the environment dynamics, which makes it unbiased from classical trajectory optimization patterns. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpace Satellite Systems and Control · Spacecraft Dynamics and Control · Optimization and Search Problems
