Environment-Aware Channel Prediction for Vehicular Communications: A Multimodal Visual Feature Fusion Framework

Xuejian Zhang; Ruisi He; Minseok Kim; Inocent Calist; Mi Yang; Ziyi Qi

arXiv:2604.02396·cs.CV·April 6, 2026

Environment-Aware Channel Prediction for Vehicular Communications: A Multimodal Visual Feature Fusion Framework

Xuejian Zhang, Ruisi He, Minseok Kim, Inocent Calist, Mi Yang, Ziyi Qi

PDF

TL;DR

This paper introduces a multimodal visual feature fusion framework for environment-aware channel prediction in vehicular communications, leveraging GPS, panoramic images, and semantic data to improve accuracy and robustness.

Contribution

It proposes a novel three-branch deep learning architecture with adaptive multimodal fusion and specialized regression heads for comprehensive channel parameter prediction.

Findings

01

Achieved RMSE of 3.26 dB for path loss prediction.

02

Predicted delay spread with RMSE of 37.66 ns.

03

Demonstrated high cosine similarity (>0.93) for APS prediction.

Abstract

The deep integration of communication with intelligence and sensing, as a defining vision of 6G, renders environment-aware channel prediction a key enabling technology. As a representative 6G application, vehicular communications require accurate and forward-looking channel prediction under stringent reliability, latency, and adaptability demands. Traditional empirical and deterministic models remain limited in balancing accuracy, generalization, and deployability, while the growing availability of onboard and roadside sensing devices offers a promising source of environmental priors. This paper proposes an environment-aware channel prediction framework based on multimodal visual feature fusion. Using GPS data and vehicle-side panoramic RGB images, together with semantic segmentation and depth estimation, the framework extracts semantic, depth, and position features through a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.