Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning
Amit Jain, Victor Rodriguez-Fernandez, Richard Linares

TL;DR
This paper presents a transformer-based reinforcement learning framework that unifies multi-phase spacecraft trajectory optimization, enabling adaptive, coherent control across different mission stages without manual phase transitions.
Contribution
The work introduces a novel transformer-augmented RL architecture that replaces traditional recurrent networks, allowing a single policy to handle multiple mission phases seamlessly.
Findings
Achieves near-optimal performance on simple benchmarks.
Successfully extends to complex multiphase rocket ascent tasks.
Maintains control stability and coherence across diverse mission regimes.
Abstract
Autonomous spacecraft control for mission phases such as launch, ascent, stage separation, and orbit insertion remains a critical challenge due to the need for adaptive policies that generalize across dynamically distinct regimes. While reinforcement learning (RL) has shown promise in individual astrodynamics tasks, existing approaches often require separate policies for distinct mission phases, limiting adaptability and increasing operational complexity. This work introduces a transformer-based RL framework that unifies multi-phase trajectory optimization through a single policy architecture, leveraging the transformer's inherent capacity to model extended temporal contexts. Building on proximal policy optimization (PPO), our framework replaces conventional recurrent networks with a transformer encoder-decoder structure, enabling the agent to maintain coherent memory across mission…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpacecraft Dynamics and Control · Space Satellite Systems and Control · Adaptive Dynamic Programming Control
