Scale What Counts, Mask What Matters: Evaluating Foundation Models for Zero-Shot Cross-Domain Wi-Fi Sensing
Cheng Jiang, Yihe Yan, Yanxiang Wang, Chun Tung Chou, Wen Hu

TL;DR
This paper demonstrates that large-scale pretraining on diverse Wi-Fi CSI datasets significantly enhances zero-shot cross-domain Wi-Fi sensing performance, highlighting data scale over model size as crucial for robustness.
Contribution
It introduces a foundation model approach with Masked Autoencoding pretraining on the largest Wi-Fi CSI datasets, systematically analyzing data diversity versus model capacity impacts.
Findings
Scaling data improves unseen domain performance log-linearly.
Larger models offer marginal gains given current data volume.
Pretraining enhances cross-domain accuracy by 2.2% to 15.7%.
Abstract
While Wi-Fi sensing offers a compelling, privacy-preserving alternative to cameras, its practical utility has been fundamentally undermined by a lack of robustness across domains. Models trained in one setup fail to generalize to new environments, hardware, or users, a critical "domain shift" problem exacerbated by modest, fragmented public datasets. We shift from this limited paradigm and apply a foundation model approach, leveraging Masked Autoencoding (MAE) style pretraining on the largest and most heterogeneous Wi-Fi CSI datasets collection assembled to date. Our study pretrains and evaluates models on over 1.3 million samples extracted from 14 datasets, collected using 4 distinct devices across the 2.4/5/6 GHz bands and bandwidths from 20 to 160 MHz. Our large-scale evaluation is the first to systematically disentangle the impacts of data diversity versus model capacity on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Wireless Networks and Protocols · Speech and Audio Processing
