RF-Analyzer: Can Vision-Language Models Learn RF Understanding from Synthetic Data?
Anis Bara, Lina Bariah, Hang Zou, Brahim Mefgouda, Merouane Debbah

TL;DR
This paper explores whether vision-language models trained solely on synthetic spectrogram data can effectively interpret real RF signals, demonstrating promising generalization but with notable limitations in low-SNR conditions.
Contribution
It introduces RF-Analyzer, a platform for evaluating VLMs on live RF signals, and establishes a benchmark showing synthetic data can enable transferable RF understanding.
Findings
VLMs trained on synthetic data can extract physical RF attributes from real signals.
Synthetic training enables some generalization but struggles with low-SNR and unseen conditions.
The benchmark framework quantifies signal understanding and grounding in RF interpretation.
Abstract
Understanding the wireless spectrum is a fundamen- tal requirement for intelligent communication systems, however, interpreting spectrograms requires extracting multiple physical attributes and reasoning about signal structure, which is a capability that is not achieved by traditional ML approaches. Recent advances in vision-language models (VLMs) demonstrated the possibility of learning such interpretation capabilities directly from data. This paper investigates whether VLMs can learn this capability from synthetic data alone, and more importantly, whether such learned representations generalize to real over-the- air RF environments. To address this question, we introduce RF-Analyzer, an SDR-to-AI analysis platform that integrates live spectrum captures associated with the corresponding VLM- based interpretation, enabling direct evaluation of VLMs outputs on live over-the-air signals.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
