Foundation Model Empowered Synesthesia of Machines (SoM): AI-native Intelligent Multi-Modal Sensing-Communication Integration
Xiang Cheng, Boxun Liu, Xuanyu Liu, Ensong Liu, Ziwei Huang

TL;DR
This paper introduces a systematic framework for integrating foundation models into AI-native multi-modal sensing and communication systems for future 6G networks, addressing current limitations of task-specific models.
Contribution
It categorizes foundation models for SoM, proposes two design roadmaps, and provides case studies demonstrating their advantages over traditional task-specific AI models.
Findings
FMs significantly outperform task-specific models in SoM tasks.
Proposed schemes enable more universal and generalizable SoM system design.
Framework facilitates systematic development of AI-native 6G sensing-communication systems.
Abstract
To support future intelligent multifunctional sixth-generation (6G) wireless communication networks, Synesthesia of Machines (SoM) is proposed as a novel paradigm for artificial intelligence (AI)-native intelligent multi-modal sensing-communication integration. However, existing SoM system designs rely on task-specific AI models and face challenges such as scarcity of massive high-quality datasets, constrained modeling capability, poor generalization, and limited universality. Recently, foundation models (FMs) have emerged as a new deep learning paradigm and have been preliminarily applied to SoM-related tasks, but a systematic design framework is still lacking. In this paper, we for the first time present a systematic categorization of FMs for SoM system design, dividing them into general-purpose FMs, specifically large language models (LLMs), and SoM domain-specific FMs, referred to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Wireless Signal Modulation Classification · Indoor and Outdoor Localization Technologies
MethodsSelf-Organizing Map
