Scale What Counts, Mask What Matters: Evaluating Foundation Models for Zero-Shot Cross-Domain Wi-Fi Sensing

Cheng Jiang; Yihe Yan; Yanxiang Wang; Chun Tung Chou; Wen Hu

arXiv:2511.18792·cs.CV·November 25, 2025

Scale What Counts, Mask What Matters: Evaluating Foundation Models for Zero-Shot Cross-Domain Wi-Fi Sensing

Cheng Jiang, Yihe Yan, Yanxiang Wang, Chun Tung Chou, Wen Hu

PDF

Open Access

TL;DR

This paper demonstrates that large-scale pretraining on diverse Wi-Fi CSI datasets significantly enhances zero-shot cross-domain Wi-Fi sensing performance, highlighting data scale over model size as crucial for robustness.

Contribution

It introduces a foundation model approach with Masked Autoencoding pretraining on the largest Wi-Fi CSI datasets, systematically analyzing data diversity versus model capacity impacts.

Findings

01

Scaling data improves unseen domain performance log-linearly.

02

Larger models offer marginal gains given current data volume.

03

Pretraining enhances cross-domain accuracy by 2.2% to 15.7%.

Abstract

While Wi-Fi sensing offers a compelling, privacy-preserving alternative to cameras, its practical utility has been fundamentally undermined by a lack of robustness across domains. Models trained in one setup fail to generalize to new environments, hardware, or users, a critical "domain shift" problem exacerbated by modest, fragmented public datasets. We shift from this limited paradigm and apply a foundation model approach, leveraging Masked Autoencoding (MAE) style pretraining on the largest and most heterogeneous Wi-Fi CSI datasets collection assembled to date. Our study pretrains and evaluates models on over 1.3 million samples extracted from 14 datasets, collected using 4 distinct devices across the 2.4/5/6 GHz bands and bandwidths from 20 to 160 MHz. Our large-scale evaluation is the first to systematically disentangle the impacts of data diversity versus model capacity on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndoor and Outdoor Localization Technologies · Wireless Networks and Protocols · Speech and Audio Processing