Meta-Reinforcement Learning Based Resource Allocation for Dynamic V2X   Communications

Yi Yuan; Gan Zheng; Kai-Kit Wong; Khaled B. Letaief

arXiv:2110.07734·cs.IT·October 18, 2021

Meta-Reinforcement Learning Based Resource Allocation for Dynamic V2X Communications

Yi Yuan, Gan Zheng, Kai-Kit Wong, Khaled B. Letaief

PDF

TL;DR

This paper introduces a meta-reinforcement learning approach for resource allocation in V2X communications, enabling fast adaptation and improved performance in dynamic vehicular environments.

Contribution

It develops a meta-based DRL algorithm that enhances adaptability and a deep RL framework combining DQN and DDPG for resource allocation in V2X networks.

Findings

01

Meta-DRL achieves rapid adaptation with limited data.

02

Proposed algorithms outperform quantized power approaches.

03

Significant performance improvements demonstrated in simulations.

Abstract

This paper studies the allocation of shared resources between vehicle-to-infrastructure (V2I) and vehicle-to-vehicle (V2V) links in vehicle-to-everything (V2X) communications. In existing algorithms, dynamic vehicular environments and quantization of continuous power become the bottlenecks for providing an effective and timely resource allocation policy. In this paper, we develop two algorithms to deal with these difficulties. First, we propose a deep reinforcement learning (DRL)-based resource allocation algorithm to improve the performance of both V2I and V2V links. Specifically, the algorithm uses deep Q-network (DQN) to solve the sub-band assignment and deep deterministic policy-gradient (DDPG) to solve the continuous power allocation problem. Second, we propose a meta-based DRL algorithm to enhance the fast adaptability of the resource allocation policy in the dynamic environment.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.