The Cost of Learning: Efficiency vs. Efficacy of Learning-Based RRM for 6G
Seyyidahmed Lahmer, Federico Chiariotti, Andrea Zanella

TL;DR
This paper explores the trade-off between learning efficiency and efficacy in 6G resource management, proposing a dynamic strategy that balances data exchange overhead with convergence speed, improving overall system performance.
Contribution
It introduces a dynamic balancing approach for DRL-based RRM in 6G, optimizing the trade-off between learning overhead and convergence speed, which outperforms static methods.
Findings
The proposed method converges faster to optimal policies.
It reduces learning overhead compared to static strategies.
Simulation results validate improved QoS maintenance.
Abstract
In the past few years, Deep Reinforcement Learning (DRL) has become a valuable solution to automatically learn efficient resource management strategies in complex networks. In many scenarios, the learning task is performed in the Cloud, while experience samples are generated directly by edge nodes or users. Therefore, the learning task involves some data exchange which, in turn, subtracts a certain amount of transmission resources from the system. This creates a friction between the need to speed up convergence towards an effective strategy, which requires the allocation of resources to transmit learning samples, and the need to maximize the amount of resources used for data plane communication, maximizing users' Quality of Service (QoS), which requires the learning process to be efficient, i.e., minimize its overhead. In this paper, we investigate this trade-off and propose a dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Advanced MIMO Systems Optimization
Methodstravel james · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
