Group Relative Policy Optimization for Robust Blind Interference Alignment with Fluid Antennas
Jianqiu Peng, Tong Zhang, Shuai Wang, Mingjie Shao, Hao Xu, and Rui Wang

TL;DR
This paper introduces a novel deep reinforcement learning framework, GRPO, for optimizing fluid antenna positions to enhance robust blind interference alignment in wireless systems with imperfect CSI.
Contribution
It proposes the first robust fluid antenna-driven BIA framework using GRPO, a new RL algorithm that improves performance and reduces computational complexity.
Findings
GRPO outperforms PPO by 4.17% in sum-rate maximization.
Pre-trained PPO with 100K steps improves by 30.29%.
GRPO exceeds heuristic methods by over 200%.
Abstract
Fluid antenna system (FAS) leverages dynamic reconfigurability to unlock spatial degrees of freedom and reshape wireless channels. Blind interference alignment (BIA) aligns interference through antenna switching. This paper proposes, for the first time, a robust fluid antenna-driven BIA framework for a K-user MISO downlink under imperfect channel state information (CSI). We formulate a robust sum-rate maximization problem through optimizing fluid antenna positions (switching positions). To solve this challenging non-convex problem, we employ group relative policy optimization (GRPO), a novel deep reinforcement learning algorithm that eliminates the critic network. This robust design reduces model size and floating point operations (FLOPs) by nearly half compared to proximal policy optimization (PPO) while significantly enhancing performance through group-based exploration that escapes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
