Multi-Modal Decentralized Reinforcement Learning for Modular Reconfigurable Lunar Robots

Ashutosh Mishra; Shreya Santra; Elian Neppel; Edoardo M. Rossi Lombardi; Shamistan Karimov; Kentaro Uno; and Kazuya Yoshida

arXiv:2510.20347·cs.RO·October 24, 2025

Multi-Modal Decentralized Reinforcement Learning for Modular Reconfigurable Lunar Robots

Ashutosh Mishra, Shreya Santra, Elian Neppel, Edoardo M. Rossi Lombardi, Shamistan Karimov, Kentaro Uno, and Kazuya Yoshida

PDF

Open Access

TL;DR

This paper introduces a decentralized reinforcement learning framework enabling modular lunar robots to autonomously learn locomotion and manipulation policies, demonstrating high success rates and efficiency in simulation and field tests for reconfigurable space tasks.

Contribution

It presents a novel decentralized RL scheme allowing each module to learn its own policy, facilitating zero-shot generalization and scalable control of reconfigurable lunar robots.

Findings

01

Steering policy achieved 3.63° mean absolute error.

02

Manipulation success rate reached 84.6%.

03

Wheel policy reduced motor torque by 95.4%.

Abstract

Modular reconfigurable robots suit task-specific space operations, but the combinatorial growth of morphologies hinders unified control. We propose a decentralized reinforcement learning (Dec-RL) scheme where each module learns its own policy: wheel modules use Soft Actor-Critic (SAC) for locomotion and 7-DoF limbs use Proximal Policy Optimization (PPO) for steering and manipulation, enabling zero-shot generalization to unseen configurations. In simulation, the steering policy achieved a mean absolute error of 3.63{\deg} between desired and induced angles; the manipulation policy plateaued at 84.6 % success on a target-offset criterion; and the wheel policy cut average motor torque by 95.4 % relative to baseline while maintaining 99.6 % success. Lunar-analogue field tests validated zero-shot integration for autonomous locomotion, steering, and preliminary alignment for reconfiguration.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Space Satellite Systems and Control · Micro and Nano Robotics