Segment-Wise Flow Matching for Vision-Aided mmWave V2I Beam Prediction

Can Zheng; Jiguang He; Chung G. Kang; Guofa Cai; Chongwen Huang; Henk Wymeersch

arXiv:2511.20265·eess.SP·April 14, 2026

Segment-Wise Flow Matching for Vision-Aided mmWave V2I Beam Prediction

Can Zheng, Jiguang He, Chung G. Kang, Guofa Cai, Chongwen Huang, Henk Wymeersch

PDF

TL;DR

This paper introduces a flow matching framework conditioned on vision data for millimeter-wave V2I beam prediction, enabling smooth, efficient, and accurate beam state forecasting.

Contribution

It presents a novel continuous flow matching approach that models beam dynamics directly, improving prediction accuracy and reducing inference latency in vehicle-to-infrastructure communication.

Findings

01

Significantly outperforms baseline beam prediction methods.

02

Approaches the performance of large language model-based techniques.

03

Reduces predictor-side inference latency by approximately 6.9 times on GPU and 2800 times on CPU.

Abstract

This paper proposes a vision-conditioned flow matching (FM) framework for beam prediction in millimeter-wave vehicle-to-infrastructure links. Instead of modeling discrete beam-index sequences, the proposed method learns the temporal evolution of normalized beam receive power vectors through a continuous vector field governed by an ordinary differential equation, enabling smooth dynamics and efficient sampling. By imposing FM over beam-state transitions and jointly optimizing beam prediction and flow consistency, the proposed framework provides a unified model for future beam prediction. Experimental results show that the proposed FM-based model significantly improves beam prediction performance over baselines, approaches the performance of large language model-based methods, and reduces predictor-side inference latency by about $6.9 \times$ on GPU and $2.8 \times 1 0^{3} \times$ on CPU,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.