Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks
Wen Qiu, Zhiqiang He, Wei Zhao, and Hiroshi Masui

TL;DR
This paper introduces PE-MAMoE, a multi-agent reinforcement learning framework with enhanced plasticity for UAV-based emergency communication, improving adaptability and performance in dynamic environments.
Contribution
It proposes a novel plasticity-enhanced mixture of experts architecture with a phase controller to maintain policy adaptability during environment shifts.
Findings
PE-MAMoE improves normalized interquartile mean return by 26.3% over baselines.
It increases served-user capacity by 12.8%.
Reduces collision rates by approximately 75%.
Abstract
Unmanned aerial vehicles serving as aerial base stations can rapidly restore connectivity after disasters, yet abrupt changes in user mobility and traffic demands shift the quality of service trade-offs and induce strong non-stationarity. Deep reinforcement learning policies suffer from plasticity loss under such shifts, as representation collapse and neuron dormancy impair adaptation. We propose plasticity enhanced multi-agent mixture of experts (PE-MAMoE), a centralized training with decentralized execution framework built on multi-agent proximal policy optimization. PE-MAMoE equips each UAV with a sparsely gated mixture of experts actor whose router selects a single specialist per step. A non-parametric Phase Controller injects brief, expert-only stochastic perturbations after phase switches, resets the action log-standard-deviation, anneals entropy and learning rate, and schedules…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
