Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach
Hao Fang, Kai Huang, Hao Ye, Chongtao Guo, Le Liang, Xiao Li, and Shi Jin

TL;DR
This paper introduces a graph neural network-based reinforcement learning method for power allocation in D2D networks, optimizing delay and fairness by leveraging topology and network state information.
Contribution
It proposes a novel GNN-enhanced RL framework with PPO for delay-aware power allocation, improving fairness and scalability in D2D communication networks.
Findings
Reduces average delay effectively
Ensures user fairness in power allocation
Outperforms baseline methods in simulations
Abstract
The pursuit of rate maximization in wireless communication frequently encounters substantial challenges associated with user fairness. This paper addresses these challenges by exploring a novel power allocation approach for delay optimization, utilizing graph neural networks (GNNs)-based reinforcement learning (RL) in device-to-device (D2D) communication. The proposed approach incorporates not only channel state information but also factors such as packet delay, the number of backlogged packets, and the number of transmitted packets into the components of the state information. We adopt a centralized RL method, where a central controller collects and processes the state information. The central controller functions as an agent trained using the proximal policy optimization (PPO) algorithm. To better utilize topology information in the communication network and enhance the generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Energy Harvesting in Wireless Networks · Software-Defined Networks and 5G
MethodsADaptive gradient method with the OPTimal convergence rate · Entropy Regularization · Proximal Policy Optimization
