Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation
Songming Jia, Yan Lu, Bin Liu, Xiang Zhang, Peng Zhao, Xinmeng Tang, Yelin Wei, Jinyang Huang, Huan Yan, Zhi Liu

TL;DR
This paper introduces PerceptAlign, a geometry-aware framework for WiFi-based 3D human pose estimation that overcomes coordinate overfitting and generalizes across different device layouts by aligning WiFi and vision data in a shared 3D space.
Contribution
PerceptAlign is the first geometry-conditioned approach for WiFi pose estimation, enabling layout-invariant predictions through coordinate unification and device geometry encoding.
Findings
Reduces in-domain error by 12.3%
Decreases cross-domain error by over 60%
Constructs the largest cross-domain WiFi pose dataset to date
Abstract
WiFi-based 3D human pose estimation offers a low-cost and privacy-preserving alternative to vision-based systems for smart interaction. However, existing approaches rely on visual 3D poses as supervision and directly regress CSI to a camera-based coordinate system. We find that this practice leads to coordinate overfitting: models memorize deployment-specific WiFi transceiver layouts rather than only learning activity-relevant representations, resulting in severe generalization failures. To address this challenge, we present PerceptAlign, the first geometry-conditioned framework for WiFi-based cross-layout pose estimation. PerceptAlign introduces a lightweight coordinate unification procedure that aligns WiFi and vision measurements in a shared 3D space using only two checkerboards and a few photos. Within this unified space, it encodes calibrated transceiver positions into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Context-Aware Activity Recognition Systems · Human Pose and Action Recognition
