Green Emergency Communications in RIS- and MA-Assisted Multi-UAV SAGINs: A Partially Observable Reinforcement Learning Approach
Liangshun Wu, Wen Chen, Shunqing Zhang, Yajun Wang, Kunlun Wang

TL;DR
This paper introduces a novel reinforcement learning approach for UAVs in disaster recovery SAGINs, effectively handling partial observability and improving communication efficiency and network coverage in urban environments.
Contribution
It proposes a spatiotemporal A2C method with local messaging and belief encoding, addressing partial observability and outperforming existing MARL techniques.
Findings
Outperforms existing MARL methods in convergence speed and reward.
Reduces TD and advantage errors significantly.
Achieves better throughput-energy trade-off.
Abstract
In post-disaster space-air-ground integrated networks (SAGINs), terrestrial infrastructure is often impaired, and unmanned aerial vehicles (UAVs) must rapidly restore connectivity for mission-critical ground terminals in cluttered non-line-of-sight (NLoS) urban environments. To enhance coverage, UAVs employ movable antennas (MAs), while reconfigurable intelligent surfaces (RISs) on surviving high-rises redirect signals. The key challenge is communication-limited partial observability, leaving each UAV with a narrow, fast-changing neighborhood view that destabilizes value estimation. Existing multi-agent reinforcement learning (MARL) approaches are inadequate--non-communication methods rely on unavailable global critics, heuristic sharing is brittle and redundant, and learnable protocols (e.g., CommNet, DIAL) lose per-neighbor structure and aggravate non-stationarity under tight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Advanced Wireless Communication Technologies · Advanced MIMO Systems Optimization
