maxVSTAR: Maximally Adaptive Vision-Guided CSI Sensing with Closed-Loop Edge Model Adaptation for Robust Human Activity Recognition
Kexing Liu

TL;DR
maxVSTAR introduces a vision-guided, closed-loop framework that enables real-time, autonomous adaptation of CSI-based human activity recognition models on edge devices, significantly improving robustness under environmental changes.
Contribution
The paper presents maxVSTAR, a novel vision-guided, self-supervised model adaptation framework for CSI-based HAR, enabling continuous online fine-tuning without manual intervention.
Findings
Restored recognition accuracy from 49.14% to 81.51% after adaptation.
Demonstrated effective domain shift mitigation in uncalibrated hardware.
Validated scalability and practicality for long-term autonomous HAR.
Abstract
WiFi Channel State Information (CSI)-based human activity recognition (HAR) provides a privacy-preserving, device-free sensing solution for smart environments. However, its deployment on edge devices is severely constrained by domain shift, where recognition performance deteriorates under varying environmental and hardware conditions. This study presents maxVSTAR (maximally adaptive Vision-guided Sensing Technology for Activity Recognition), a closed-loop, vision-guided model adaptation framework that autonomously mitigates domain shift for edge-deployed CSI sensing systems. The proposed system integrates a cross-modal teacher-student architecture, where a high-accuracy YOLO-based vision model serves as a dynamic supervisory signal, delivering real-time activity labels for the CSI data stream. These labels enable autonomous, online fine-tuning of a lightweight CSI-based HAR model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
