Large Model for Small Data: Foundation Model for Cross-Modal RF Human Activity Recognition
Yuxuan Weng, Guoquan Wu, Tianyue Zheng, Yanbing Yang, and Jun Luo

TL;DR
This paper introduces FM-Fi, a cross-modal framework that leverages vision-based foundation models to improve RF-based human activity recognition, especially in small data scenarios, through knowledge distillation and few-shot learning.
Contribution
The paper presents a novel cross-modal contrastive knowledge distillation method and a framework that adapts vision-based foundation models for RF HAR, enabling zero-shot and few-shot learning.
Findings
FM-Fi achieves comparable performance to vision-based methods.
It demonstrates strong generalizability across different environments.
The framework effectively enhances RF HAR with limited data.
Abstract
Radio-Frequency (RF)-based Human Activity Recognition (HAR) rises as a promising solution for applications unamenable to techniques requiring computer visions. However, the scarcity of labeled RF data due to their non-interpretable nature poses a significant obstacle. Thanks to the recent breakthrough of foundation models (FMs), extracting deep semantic insights from unlabeled visual data become viable, yet these vision-based FMs fall short when applied to small RF datasets. To bridge this gap, we introduce FM-Fi, an innovative cross-modal framework engineered to translate the knowledge of vision-based FMs for enhancing RF-based HAR systems. FM-Fi involves a novel cross-modal contrastive knowledge distillation mechanism, enabling an RF encoder to inherit the interpretative power of FMs for achieving zero-shot learning. It also employs the intrinsic capabilities of FM and RF to remove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Context-Aware Activity Recognition Systems · Advanced Computing and Algorithms
MethodsKnowledge Distillation
