Segment-Wise Flow Matching for Vision-Aided mmWave V2I Beam Prediction
Can Zheng, Jiguang He, Chung G. Kang, Guofa Cai, Chongwen Huang, Henk Wymeersch

TL;DR
This paper introduces a flow matching framework conditioned on vision data for millimeter-wave V2I beam prediction, enabling smooth, efficient, and accurate beam state forecasting.
Contribution
It presents a novel continuous flow matching approach that models beam dynamics directly, improving prediction accuracy and reducing inference latency in vehicle-to-infrastructure communication.
Findings
Significantly outperforms baseline beam prediction methods.
Approaches the performance of large language model-based techniques.
Reduces predictor-side inference latency by approximately 6.9 times on GPU and 2800 times on CPU.
Abstract
This paper proposes a vision-conditioned flow matching (FM) framework for beam prediction in millimeter-wave vehicle-to-infrastructure links. Instead of modeling discrete beam-index sequences, the proposed method learns the temporal evolution of normalized beam receive power vectors through a continuous vector field governed by an ordinary differential equation, enabling smooth dynamics and efficient sampling. By imposing FM over beam-state transitions and jointly optimizing beam prediction and flow consistency, the proposed framework provides a unified model for future beam prediction. Experimental results show that the proposed FM-based model significantly improves beam prediction performance over baselines, approaches the performance of large language model-based methods, and reduces predictor-side inference latency by about on GPU and on CPU,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
