Learning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks

dc.centroE.T.S.I. Telecomunicación
dc.contributor.authorLozano Cuadra, Federico
dc.contributor.authorSoret, Beatriz
dc.contributor.authorSánchez Net, Marc
dc.contributor.authorCauligi, Abhishek
dc.contributor.authorRossi, Federico
dc.date.accessioned2026-02-11T11:56:25Z
dc.date.issued2025-10-23
dc.departamentoIngeniería de Comunicaciones
dc.description.abstractWe present a fully decentralized routing framework for multi-robot exploration missions operating under the constraints of a Lunar Delay-Tolerant Network (LDTN). In this setting, autonomous rovers must relay collected data to a lander under intermittent connectivity and unknown mobility patterns. We formulate the problem as a Partially Observable Markov Decision Problem (POMDP) and propose a Graph Attention-based Multi-Agent Reinforcement Learning (GAT-MARL) policy that performs Centralized Training, Decentralized Execution (CTDE). Our method relies only on local observations and does not require global topology updates or packet replication, unlike classical approaches such as shortest path and controlled flooding-based algorithms. Through Monte Carlo simulations in randomized exploration environments, GAT-MARL provides higher delivery rates, no duplications, and fewer packet losses, and is able to leverage short-term mobility forecasts; offering a scalable solution for future space robotic systems for planetary exploration, as demonstrated by successful generalization to larger rover teams.
dc.description.sponsorshipMinisterio de Ciencia, Innovación y Universidades
dc.description.sponsorshipERDD: A way of making Europe
dc.description.sponsorshipNational Aeronautics and Space Administration (NASA)
dc.identifier.otherhttps://arxiv.org/pdf/2510.20436
dc.identifier.urihttps://hdl.handle.net/10630/45373
dc.language.isoeng
dc.relation.eventdate1-4 Diciembre 2025
dc.relation.eventplaceSendai, Japon
dc.relation.eventtitleInternational Conference on Space Robotics 2025 (iSpaRo)
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectAprendizaje automático
dc.subjectAstronáutica - Sistemas de comunicaciones
dc.subject.otherDelay tolerant networks
dc.subject.otherReinforcement learning
dc.subject.otherMulti agent
dc.subject.otherGraph attention networks
dc.titleLearning Decentralized Routing Policies via Graph Attention-based Multi-Agent Reinforcement Learning in Lunar Delay-Tolerant Networks
dc.typeconference output
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2510.20436v1.pdf
Size:
1.61 MB
Format:
Adobe Portable Document Format