Learning Value of Information towards Joint Communication and Control in 6G V2X
Lei Lei, Kan Zheng, Xuemin (Sherman) Shen

TL;DR
This paper introduces a new stochastic decision process model to quantify the value of information, aiming to optimize joint communication and control in 6G vehicle networks using reinforcement learning.
Contribution
It proposes the SSDP model as a general framework for evaluating VoI, extending beyond traditional MDPs, and integrates it with RL and optimal control for CAV decision-making.
Findings
SSDP model explicitly represents information sets for decision-making.
Framework unifies VoI estimation methods within RL and control theories.
Illustrated with vehicle-following control, showing potential for broader networked control systems.
Abstract
As Cellular Vehicle-to-Everything (C-V2X) evolves towards future sixth-generation (6G) networks, Connected Autonomous Vehicles (CAVs) are emerging to become a key application. Leveraging data-driven Machine Learning (ML), especially Deep Reinforcement Learning (DRL), is expected to significantly enhance CAV decision-making in both vehicle control and V2X communication under uncertainty. These two decision-making processes are closely intertwined, with the value of information (VoI) acting as a crucial bridge between them. In this paper, we introduce Sequential Stochastic Decision Process (SSDP) models to define and assess VoI, demonstrating their application in optimizing communication systems for CAVs. Specifically, we formally define the SSDP model and demonstrate that the MDP model is a special case of it. The SSDP model offers a key advantage by explicitly representing the set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicular Ad Hoc Networks (VANETs) · Traffic control and management · Age of Information Optimization
MethodsSparse Evolutionary Training
