Autonomous Reasoning for Spacecraft Control: A Large Language Model Framework with Group Relative Policy Optimization
Amit Jain, Richard Linares

TL;DR
This paper introduces a novel framework combining large language models with group relative policy optimization to develop autonomous spacecraft control policies that are both effective and interpretable across various dynamical systems.
Contribution
It presents a two-stage training approach integrating supervised fine-tuning and GRPO, enabling LLMs to generate feasible control policies with human-readable explanations.
Findings
Successfully applied to linear and nonlinear control problems
Generated interpretable control sequences and explanations
Demonstrated feasibility in complex spacecraft attitude control
Abstract
This paper presents a learning-based guidance-and-control approach that couples a reasoning-enabled Large Language Model (LLM) with Group Relative Policy Optimization (GRPO). A two-stage procedure consisting of Supervised Fine-Tuning (SFT) to learn formatting and control primitives, followed by GRPO for interaction-driven policy improvement, trains controllers for each environment. The framework is demonstrated on four control problems spanning a gradient of dynamical complexity, from canonical linear systems through nonlinear oscillatory dynamics to three-dimensional spacecraft attitude control with gyroscopic coupling and thrust constraints. Results demonstrate that an LLM with explicit reasoning, optimized via GRPO, can synthesize feasible stabilizing policies under consistent training settings across both linear and nonlinear systems. The two-stage training methodology enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Spacecraft Dynamics and Control
