Meta-Reinforcement Learning Based Resource Allocation for Dynamic V2X Communications
Yi Yuan, Gan Zheng, Kai-Kit Wong, Khaled B. Letaief

TL;DR
This paper introduces a meta-reinforcement learning approach for resource allocation in V2X communications, enabling fast adaptation and improved performance in dynamic vehicular environments.
Contribution
It develops a meta-based DRL algorithm that enhances adaptability and a deep RL framework combining DQN and DDPG for resource allocation in V2X networks.
Findings
Meta-DRL achieves rapid adaptation with limited data.
Proposed algorithms outperform quantized power approaches.
Significant performance improvements demonstrated in simulations.
Abstract
This paper studies the allocation of shared resources between vehicle-to-infrastructure (V2I) and vehicle-to-vehicle (V2V) links in vehicle-to-everything (V2X) communications. In existing algorithms, dynamic vehicular environments and quantization of continuous power become the bottlenecks for providing an effective and timely resource allocation policy. In this paper, we develop two algorithms to deal with these difficulties. First, we propose a deep reinforcement learning (DRL)-based resource allocation algorithm to improve the performance of both V2I and V2V links. Specifically, the algorithm uses deep Q-network (DQN) to solve the sub-band assignment and deep deterministic policy-gradient (DDPG) to solve the continuous power allocation problem. Second, we propose a meta-based DRL algorithm to enhance the fast adaptability of the resource allocation policy in the dynamic environment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
