Spatiotemporal Continual Learning for Mobile Edge UAV Networks: Mitigating Catastrophic Forgetting
Chuan-Chi Lai

TL;DR
This paper introduces a spatiotemporal continual learning framework for UAV networks that mitigates catastrophic forgetting and maintains high performance under dynamic, high-density conditions.
Contribution
It proposes the G-MAPPO algorithm with gradient orthogonalization and spatial compensation to enhance resilience and scalability in mobile edge UAV networks.
Findings
Service reliability exceeds 0.9 for up to 100 users.
G-MAPPO outperforms MADDPG under extreme saturation.
Achieves 20% capacity gain under high traffic loads.
Abstract
This paper addresses catastrophic forgetting in mobile edge UAV networks within dynamic spatiotemporal environments. Conventional deep reinforcement learning often fails during task transitions, necessitating costly retraining to adapt to new user distributions. We propose the spatiotemporal continual learning (STCL) framework, realized through the group-decoupled multi-agent proximal policy optimization (G-MAPPO) algorithm. The core innovation lies in the integration of a group-decoupled policy optimization (GDPO) mechanism with a gradient orthogonalization layer to balance heterogeneous objectives including energy efficiency, user fairness, and coverage. This combination employs dynamic z-score normalization and gradient projection to mitigate conflicts without offline resets. Furthermore, 3D UAV mobility serves as a spatial compensation layer to manage extreme density shifts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
