Toward Trusted Onboard AI: Advancing Small Satellite Operations using Reinforcement Learning

Cannon Whitney; Joseph Melville

arXiv:2507.22198·eess.SY·July 31, 2025

Toward Trusted Onboard AI: Advancing Small Satellite Operations using Reinforcement Learning

Cannon Whitney, Joseph Melville

PDF

TL;DR

This paper presents a reinforcement learning-based command automation system for a 3U CubeSat, demonstrating its potential for trusted, autonomous onboard control by using a digital twin for safe validation and real-time decision-making.

Contribution

It introduces a macro control action RL approach tailored for onboard satellite operations, enabling high-level decision making and trust-building in autonomous space systems.

Findings

01

RL agent successfully issued high-level commands in simulation and on orbit

02

Digital twin enabled safe validation of RL predictions

03

Onboard RL improved response times and reduced ground control reliance

Abstract

A RL (Reinforcement Learning) algorithm was developed for command automation onboard a 3U CubeSat. This effort focused on the implementation of macro control action RL, a technique in which an onboard agent is provided with compiled information based on live telemetry as its observation. The agent uses this information to produce high-level actions, such as adjusting attitude to solar pointing, which are then translated into control algorithms and executed through lower-level instructions. Once trust in the onboard agent is established, real-time environmental information can be leveraged for faster response times and reduced reliance on ground control. The approach not only focuses on developing an RL algorithm for a specific satellite but also sets a precedent for integrating trusted AI into onboard systems. This research builds on previous work in three areas: (1) RL algorithms for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.