PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision
Zehua Han, Jing Xiao, Yiqi Duan, Mengyu Xiang, Yuheng Ji, Xiaolong Zheng, Chenghanyu Zhang, Zhendong She, Junyu Shen, Dingwei Tan, Shichu Sun, Zhou Cong, Mingxuan Liu, Fengxiang Wang, Jinping Sun, Yangang Sun

TL;DR
PReD is a pioneering multimodal foundation model for electromagnetic signals, integrating perception, recognition, and decision-making to enhance EM domain understanding and reasoning capabilities.
Contribution
It introduces the first EM-focused foundation model with a large multitask dataset and benchmark, enabling end-to-end EM signal understanding and reasoning.
Findings
Achieves state-of-the-art performance on EM signal tasks.
Demonstrates effective multi-task training for EM signals.
Validates vision-aligned foundation models in the EM domain.
Abstract
Multimodal Large Language Models have demonstrated powerful cross-modal understanding and reasoning capabilities in general domains. However, in the electromagnetic (EM) domain, they still face challenges such as data scarcity and insufficient integration of domain knowledge. This paper proposes PReD, the first foundation model for the EM domain that covers the intelligent closed-loop of "perception, recognition, decision-making." We constructed a high-quality multitask EM dataset, PReD-1.3M, and an evaluation benchmark, PReD-Bench. The dataset encompasses multi-perspective representations such as raw time-domain waveform, frequency-domain spectrograms, and constellation diagrams, covering typical features of communication and radar signals. It supports a range of core tasks, including signal detection, modulation recognition, parameter estimation, protocol recognition, radio frequency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
