Autonomous Reasoning for Spacecraft Control: A Large Language Model Framework with Group Relative Policy Optimization

Amit Jain; Richard Linares

arXiv:2601.04334·cs.RO·January 9, 2026

Autonomous Reasoning for Spacecraft Control: A Large Language Model Framework with Group Relative Policy Optimization

Amit Jain, Richard Linares

PDF

Open Access

TL;DR

This paper introduces a novel framework combining large language models with group relative policy optimization to develop autonomous spacecraft control policies that are both effective and interpretable across various dynamical systems.

Contribution

It presents a two-stage training approach integrating supervised fine-tuning and GRPO, enabling LLMs to generate feasible control policies with human-readable explanations.

Findings

01

Successfully applied to linear and nonlinear control problems

02

Generated interpretable control sequences and explanations

03

Demonstrated feasibility in complex spacecraft attitude control

Abstract

This paper presents a learning-based guidance-and-control approach that couples a reasoning-enabled Large Language Model (LLM) with Group Relative Policy Optimization (GRPO). A two-stage procedure consisting of Supervised Fine-Tuning (SFT) to learn formatting and control primitives, followed by GRPO for interaction-driven policy improvement, trains controllers for each environment. The framework is demonstrated on four control problems spanning a gradient of dynamical complexity, from canonical linear systems through nonlinear oscillatory dynamics to three-dimensional spacecraft attitude control with gyroscopic coupling and thrust constraints. Results demonstrate that an LLM with explicit reasoning, optimized via GRPO, can synthesize feasible stabilizing policies under consistent training settings across both linear and nonlinear systems. The two-stage training methodology enables…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Spacecraft Dynamics and Control