CSI-JEPA: Towards Foundation Representations for Ubiquitous Sensing with Minimal Supervision
Xuanhao Luo, Zhizhen Li, Yuchen Liu

TL;DR
CSI-JEPA introduces a self-supervised learning framework for Wi-Fi sensing that learns reusable representations from unlabeled CSI data, significantly reducing the need for labeled data and improving multi-task sensing performance.
Contribution
It proposes a novel self-supervised predictive learning method tailored for CSI data, with a channel variation-aware masking strategy and multi-task adaptation, enhancing label efficiency and generalization.
Findings
Achieves up to 10.64% accuracy improvement over supervised models.
Reduces labeled data requirements by up to 98%.
Improves performance across seven diverse Wi-Fi sensing tasks.
Abstract
Channel state information (CSI) provides a widely available sensing modality for human and environment perception, but existing CSI sensing models usually rely on task-specific supervised training and require substantial labeled data for each task, device, user, or environment. This limits their scalability in practical deployments where unlabeled CSI is abundant but labeled data is costly to collect. In this paper, we present CSI-JEPA, a self-supervised predictive representation learning framework for label-efficient, multi-task Wi-Fi sensing. CSI-JEPA learns reusable temporal-spectral representations from unlabeled CSI samples by predicting latent features of masked channel regions from visible context. To better match the physical structure of CSI, CSI-JEPA tokenizes channel-response amplitude windows along the time and subcarrier dimensions. It then introduces a channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
