Autonomous Planning In-space Assembly Reinforcement-learning free-flYer (APIARY) International Space Station Astrobee Testing

Samantha Chapin; Kenneth Stewart; Roxana Leontie; and Carl Glen Henshaw

arXiv:2512.03729·cs.RO·April 1, 2026

Autonomous Planning In-space Assembly Reinforcement-learning free-flYer (APIARY) International Space Station Astrobee Testing

Samantha Chapin, Kenneth Stewart, Roxana Leontie, and Carl Glen Henshaw

PDF

TL;DR

This paper reports the first successful reinforcement learning control of a free-flying robot in space, demonstrating its potential to enhance autonomous space operations through simulation, ground, and on-orbit testing.

Contribution

It introduces a novel RL-based control policy for space robots, validated through simulation, ground, and actual spaceflight on the ISS, showcasing rapid deployment capabilities.

Findings

01

RL control policy achieved robust 6-DOF control in space.

02

Successful on-orbit validation of RL control of a free-flyer.

03

Demonstrated rapid development and deployment of space robot behaviors.

Abstract

The US Naval Research Laboratory's (NRL's) Autonomous Planning In-space Assembly Reinforcement-learning free-flYer (APIARY) experiment pioneers the use of reinforcement learning (RL) for control of free-flying robots in the zero-gravity (zero-G) environment of space. On Tuesday, May 27th 2025 the APIARY team conducted the first ever, to our knowledge, RL control of a free-flyer in space using the NASA Astrobee robot on-board the International Space Station (ISS). A robust 6-degrees of freedom (DOF) control policy was trained using an actor-critic Proximal Policy Optimization (PPO) network within the NVIDIA Isaac Lab simulation environment, randomizing over goal poses and mass distributions to enhance robustness. This paper details the simulation testing, ground testing, and flight validation of this experiment. This on-orbit demonstration validates the transformative potential of RL for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.