Federated Reinforcement Learning to Optimize Teleoperated Driving Networks
Filippo Bragato, Marco Giordani, Michele Zorzi

TL;DR
This paper investigates federated reinforcement learning algorithms to optimize predictive quality of service for teleoperated driving in 6G networks, balancing latency and data compression quality.
Contribution
It compares various RL algorithms in a federated setup for PQoS, identifying Q-Learning as the most effective for teleoperated driving scenarios.
Findings
Q-Learning achieves the best average reward and convergence.
Federated setup improves convergence time and fairness.
Q-Learning has low computational cost.
Abstract
Several sixth generation (6G) use cases have tight requirements in terms of reliability and latency, in particular teleoperated driving (TD). To address those requirements, Predictive Quality of Service (PQoS), possibly combined with reinforcement learning (RL), has emerged as a valid approach to dynamically adapt the configuration of the TD application (e.g., the level of compression of automotive data) to the experienced network conditions. In this work, we explore different classes of RL algorithms for PQoS, namely MAB (stateless), SARSA (stateful on-policy), Q-Learning (stateful off-policy), and DSARSA and DDQN (with Neural Network (NN) approximation). We trained the agents in a federated learning (FL) setup to improve the convergence time and fairness, and to promote privacy and security. The goal is to optimize the trade-off between Quality of Service (QoS), measured in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management
