Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

Federico Lozano-Cuadra; Beatriz Soret; Marc Sanchez Net; Abhishek Cauligi; and Federico Rossi

arXiv:2510.20436·stat.ML·October 24, 2025

Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

Federico Lozano-Cuadra, Beatriz Soret, Marc Sanchez Net, Abhishek Cauligi, and Federico Rossi

PDF

Open Access

TL;DR

This paper introduces a decentralized multi-agent reinforcement learning approach using graph attention mechanisms for routing data in lunar delay-tolerant networks, improving delivery rates without global topology knowledge.

Contribution

It proposes a novel GAT-MARL framework that enables decentralized routing with local observations, scalable to larger rover teams, and does not rely on classical flooding or shortest path algorithms.

Findings

01

Higher delivery rates compared to classical methods

02

No packet duplications or losses in simulations

03

Effective generalization to larger rover teams

Abstract

We present a fully decentralized routing framework for multi-robot exploration missions operating under the constraints of a Lunar Delay-Tolerant Network (LDTN). In this setting, autonomous rovers must relay collected data to a lander under intermittent connectivity and unknown mobility patterns. We formulate the problem as a Partially Observable Markov Decision Problem (POMDP) and propose a Graph Attention-based Multi-Agent Reinforcement Learning (GAT-MARL) policy that performs Centralized Training, Decentralized Execution (CTDE). Our method relies only on local observations and does not require global topology updates or packet replication, unlike classical approaches such as shortest path and controlled flooding-based algorithms. Through Monte Carlo simulations in randomized exploration environments, GAT-MARL provides higher delivery rates, no duplications, and fewer packet losses,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpportunistic and Delay-Tolerant Networks · Distributed Control Multi-Agent Systems · Spacecraft Dynamics and Control