ViT LoS V2X: Vision Transformers for Environment-aware LoS Blockage Prediction for 6G Vehicular Networks
Ghazi Gharsallah, Georges Kaddoum

TL;DR
This paper introduces a novel deep learning framework combining CNNs, Vision Transformers, and GRUs to predict line-of-sight blockages in 6G vehicular networks using multimodal data, achieving over 95% accuracy.
Contribution
It presents a new multimodal deep learning approach integrating CNNs, ViTs, and GRUs for environment-aware blockage prediction in vehicular networks, enhancing prediction accuracy.
Findings
Achieves over 95% prediction accuracy.
Outperforms existing state-of-the-art methods.
Effectively utilizes multimodal sensor data.
Abstract
As wireless communication technology progresses towards the sixth generation (6G), high-frequency millimeter-wave (mmWave) communication has emerged as a promising candidate for enabling vehicular networks. It offers high data rates and low-latency communication. However, obstacles such as buildings, trees, and other vehicles can cause signal attenuation and blockage, leading to communication failures that can result in fatal accidents or traffic congestion. Predicting blockages is crucial for ensuring reliable and efficient communications. Furthermore, the advent of 6G technology is anticipated to integrate advanced sensing capabilities, utilizing a variety of sensor types. These sensors, ranging from traditional RF sensors to cameras and Lidar sensors, are expected to provide access to rich multimodal data, thereby enriching communication systems with a wealth of additional contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
