Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric
Zhenyu Tao, Wei Xu, Xiaohu You

TL;DR
This paper introduces a new Wasserstein-based bisimulation metric for digital twin-driven deep reinforcement learning in wireless networks, providing provable bounds on real-world performance and addressing computational challenges.
Contribution
It proposes the DT bisimulation metric (DT-BSM), proves bounds on policy sub-optimality, and develops an empirical sampling method with convergence guarantees.
Findings
The DT-BSM bounds real-world policy performance.
The empirical DT-BSM converges to the theoretical metric.
Numerical experiments validate the performance bounds.
Abstract
Digital twin (DT)-driven deep reinforcement learning (DRL) has emerged as a promising paradigm for wireless network optimization, offering safe and efficient training environment for policy exploration. However, in theory existing methods cannot always guarantee real-world performance of DT-trained policies before actual deployment, due to the absence of a universal metric for assessing DT's ability to support reliable DRL training transferrable to physical networks. In this paper, we propose the DT bisimulation metric (DT-BSM), a novel metric based on the Wasserstein distance, to quantify the discrepancy between Markov decision processes (MDPs) in both the DT and the corresponding real-world wireless network environment. We prove that for any DT-trained policy, the sub-optimality of its performance (regret) in the real-world deployment is bounded by a weighted sum of the DT-BSM and its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMuscle activation and electromyography studies · Advancements in Semiconductor Devices and Circuit Design · Neuroscience and Neural Engineering
