Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning
Jiexin Zhang, Shu Xu, Chunguo Li, Yongming Huang, and Luxi Yang

TL;DR
This paper introduces a digital twin-assisted deep reinforcement learning framework for efficient beam selection in cell-free ISAC systems, significantly reducing online interaction costs while maintaining high performance.
Contribution
The work develops a novel offline DRL approach using a cGAN-based digital twin to improve beam selection in ISAC, addressing real-time interaction challenges.
Findings
Reduces online interaction overhead in beam selection.
Maintains high detection performance under various conditions.
Theoretically guarantees convergence and bounds of the proposed method.
Abstract
Beamforming enhances signal strength and quality by focusing energy in specific directions. This capability is particularly crucial in cell-free integrated sensing and communication (ISAC) systems, where multiple distributed access points (APs) collaborate to provide both communication and sensing services. In this work, we first derive the distribution of joint target detection probabilities across multiple receiving APs under false alarm rate constraints, and then formulate the beam selection procedure as a Markov decision process (MDP). We establish a deep reinforcement learning (DRL) framework, in which reward shaping and sinusoidal embedding are introduced to facilitate agent learning. To eliminate the high costs and associated risks of real-time agent-environment interactions, we further propose a novel digital twin (DT)-assisted offline DRL approach. Different from traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
