Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach

Hao Fang; Kai Huang; Hao Ye; Chongtao Guo; Le Liang; Xiao Li; and Shi Jin

arXiv:2505.12902·eess.SY·May 20, 2025

Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach

Hao Fang, Kai Huang, Hao Ye, Chongtao Guo, Le Liang, Xiao Li, and Shi Jin

PDF

Open Access

TL;DR

This paper introduces a graph neural network-based reinforcement learning method for power allocation in D2D networks, optimizing delay and fairness by leveraging topology and network state information.

Contribution

It proposes a novel GNN-enhanced RL framework with PPO for delay-aware power allocation, improving fairness and scalability in D2D communication networks.

Findings

01

Reduces average delay effectively

02

Ensures user fairness in power allocation

03

Outperforms baseline methods in simulations

Abstract

The pursuit of rate maximization in wireless communication frequently encounters substantial challenges associated with user fairness. This paper addresses these challenges by exploring a novel power allocation approach for delay optimization, utilizing graph neural networks (GNNs)-based reinforcement learning (RL) in device-to-device (D2D) communication. The proposed approach incorporates not only channel state information but also factors such as packet delay, the number of backlogged packets, and the number of transmitted packets into the components of the state information. We adopt a centralized RL method, where a central controller collects and processes the state information. The central controller functions as an agent trained using the proximal policy optimization (PPO) algorithm. To better utilize topology information in the communication network and enhance the generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced MIMO Systems Optimization · Energy Harvesting in Wireless Networks · Software-Defined Networks and 5G

MethodsADaptive gradient method with the OPTimal convergence rate · Entropy Regularization · Proximal Policy Optimization