Toward Trusted Onboard AI: Advancing Small Satellite Operations using Reinforcement Learning
Cannon Whitney, Joseph Melville

TL;DR
This paper presents a reinforcement learning-based command automation system for a 3U CubeSat, demonstrating its potential for trusted, autonomous onboard control by using a digital twin for safe validation and real-time decision-making.
Contribution
It introduces a macro control action RL approach tailored for onboard satellite operations, enabling high-level decision making and trust-building in autonomous space systems.
Findings
RL agent successfully issued high-level commands in simulation and on orbit
Digital twin enabled safe validation of RL predictions
Onboard RL improved response times and reduced ground control reliance
Abstract
A RL (Reinforcement Learning) algorithm was developed for command automation onboard a 3U CubeSat. This effort focused on the implementation of macro control action RL, a technique in which an onboard agent is provided with compiled information based on live telemetry as its observation. The agent uses this information to produce high-level actions, such as adjusting attitude to solar pointing, which are then translated into control algorithms and executed through lower-level instructions. Once trust in the onboard agent is established, real-time environmental information can be leveraged for faster response times and reduced reliance on ground control. The approach not only focuses on developing an RL algorithm for a specific satellite but also sets a precedent for integrating trusted AI into onboard systems. This research builds on previous work in three areas: (1) RL algorithms for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
