Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

Lozano Cuadra, Federico; Soret, Beatriz; Sánchez Net, Marc; Cauligi, Abhishek; Rossi, Federico

Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

dc.centro	E.T.S.I. Telecomunicación
dc.contributor.author	Lozano Cuadra, Federico
dc.contributor.author	Soret, Beatriz
dc.contributor.author	Sánchez Net, Marc
dc.contributor.author	Cauligi, Abhishek
dc.contributor.author	Rossi, Federico
dc.date.accessioned	2026-02-11T11:56:25Z
dc.date.issued	2025-10-23
dc.departamento	Ingeniería de Comunicaciones
dc.description.abstract	We present a fully decentralized routing framework for multi-robot exploration missions operating under the constraints of a Lunar Delay-Tolerant Network (LDTN). In this setting, autonomous rovers must relay collected data to a lander under intermittent connectivity and unknown mobility patterns. We formulate the problem as a Partially Observable Markov Decision Problem (POMDP) and propose a Graph Attention-based Multi-Agent Reinforcement Learning (GAT-MARL) policy that performs Centralized Training, Decentralized Execution (CTDE). Our method relies only on local observations and does not require global topology updates or packet replication, unlike classical approaches such as shortest path and controlled flooding-based algorithms. Through Monte Carlo simulations in randomized exploration environments, GAT-MARL provides higher delivery rates, no duplications, and fewer packet losses, and is able to leverage short-term mobility forecasts; offering a scalable solution for future space robotic systems for planetary exploration, as demonstrated by successful generalization to larger rover teams.
dc.description.sponsorship	Ministerio de Ciencia, Innovación y Universidades
dc.description.sponsorship	ERDD: A way of making Europe
dc.description.sponsorship	National Aeronautics and Space Administration (NASA)
dc.identifier.other	https://arxiv.org/pdf/2510.20436
dc.identifier.uri	https://hdl.handle.net/10630/45373
dc.language.iso	eng
dc.relation.eventdate	1-4 Diciembre 2025
dc.relation.eventplace	Sendai, Japon
dc.relation.eventtitle	International Conference on Space Robotics 2025 (iSpaRo)
dc.rights	Attribution 4.0 International	en
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Aprendizaje automático
dc.subject	Astronáutica - Sistemas de comunicaciones
dc.subject.other	Delay tolerant networks
dc.subject.other	Reinforcement learning
dc.subject.other	Multi agent
dc.subject.other	Graph attention networks
dc.title	Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks
dc.type	conference output
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2510.20436v1.pdf
Size:: 1.61 MB
Format:: Adobe Portable Document Format

Download

Collections

Ponencias, Comunicaciones a congresos y Pósteres