Reinforcement Learning for Joint V2I Network Selection and Autonomous Driving Policies
Zijiang Yan, Hina Tabassum

TL;DR
This paper introduces a reinforcement learning framework that optimizes autonomous vehicle driving and network selection policies simultaneously, improving safety, traffic flow, and communication data rates in multi-band vehicular networks.
Contribution
It develops a deep Q-learning based approach to jointly optimize vehicle motion and network choices, addressing uncertainties in V2I communication and traffic conditions.
Findings
Enhanced vehicle safety and connectivity through RL policies.
Improved data rates and reduced handoffs in multi-band V2I networks.
Insights into the interplay between vehicle dynamics and communication performance.
Abstract
Vehicle-to-Infrastructure (V2I) communication is becoming critical for the enhanced reliability of autonomous vehicles (AVs). However, the uncertainties in the road-traffic and AVs' wireless connections can severely impair timely decision-making. It is thus critical to simultaneously optimize the AVs' network selection and driving policies in order to minimize road collisions while maximizing the communication data rates. In this paper, we develop a reinforcement learning (RL) framework to characterize efficient network selection and autonomous driving policies in a multi-band vehicular network (VNet) operating on conventional sub-6GHz spectrum and Terahertz (THz) frequencies. The proposed framework is designed to (i) maximize the traffic flow and minimize collisions by controlling the vehicle's motion dynamics (i.e., speed and acceleration) from autonomous driving perspective, and (ii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicular Ad Hoc Networks (VANETs) · Millimeter-Wave Propagation and Modeling
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Q-Learning
