Deep Deterministic Policy Gradient to Minimize the Age of Information in Cellular V2X Communications
Zoubeir Mlika, Soumaya Cherkaoui

TL;DR
This paper introduces a novel deep reinforcement learning approach with a decomposition technique to minimize the age of information in cellular V2X communications, achieving significant performance improvements over benchmarks.
Contribution
It proposes a combined matching and deep reinforcement learning method to efficiently optimize resource allocation for AoI minimization in V2X networks.
Findings
Achieves up to 66% performance gain over benchmarks.
Successfully minimizes AoI across various vehicle speeds and transmission parameters.
Identifies an optimal broadcast coverage value for best AoI performance.
Abstract
This paper studies the problem of minimizing the age of information (AoI) in cellular vehicle-to-everything communications. To provide minimal AoI and high reliability for vehicles' safety information, NOMA is exploited. We reformulate a resource allocation problem that involves half-duplex transceiver selection, broadcast coverage optimization, power allocation, and resource block scheduling. First, to obtain the optimal solution, we formulate the problem as a mixed-integer nonlinear programming problem and then study its NP-hardness. The NP-hardness result motivates us to design simple solutions. Consequently, we model the problem as a single-agent Markov decision process to solve the problem efficiently using fingerprint deep reinforcement learning techniques such as deep-Q-network (DQN) methods. Nevertheless, applying DQN is not straightforward due to the curse of dimensionality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
