Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
Yu Tian, Gaofeng Pan, Mohamed-Slim Alouini

TL;DR
This paper explores how deep learning-based computer vision techniques can enhance wireless communication systems, particularly in dynamic scenarios, by using visual data to improve beamforming accuracy in mmWave MIMO systems.
Contribution
It introduces a novel framework combining DL-based CV with wireless communication, demonstrating improved beamforming accuracy using visual data in mobile scenarios.
Findings
Visual data significantly improves MIMO beamforming performance.
The proposed framework outperforms baseline methods in accuracy.
DL-based CV offers new opportunities for wireless communication enhancement.
Abstract
Deep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other fields. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras. Therefore, exploring DL-based CV may yield useful information about objects, such as their number, locations, distribution, motion, etc. Intuitively, DL-based CV can also facilitate and improve the designs of wireless communications, especially in dynamic network scenarios. However, so far, such work is rare in the literature. The primary purpose of this article, then, is to introduce ideas about applying DL-based CV in wireless communications to bring some novel degrees of freedom to both theoretical research and engineering applications. To illustrate how DL-based CV can be applied in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMillimeter-Wave Propagation and Modeling · Speech and Audio Processing · Video Surveillance and Tracking Methods
Methods1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block · Batch Normalization · Average Pooling · Max Pooling · Global Average Pooling · Residual Connection · Kaiming Initialization · Convolution
