Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation

Songming Jia; Yan Lu; Bin Liu; Xiang Zhang; Peng Zhao; Xinmeng Tang; Yelin Wei; Jinyang Huang; Huan Yan; Zhi Liu

arXiv:2601.12252·cs.HC·January 21, 2026

Breaking Coordinate Overfitting: Geometry-Aware WiFi Sensing for Cross-Layout 3D Pose Estimation

Songming Jia, Yan Lu, Bin Liu, Xiang Zhang, Peng Zhao, Xinmeng Tang, Yelin Wei, Jinyang Huang, Huan Yan, Zhi Liu

PDF

Open Access

TL;DR

This paper introduces PerceptAlign, a geometry-aware framework for WiFi-based 3D human pose estimation that overcomes coordinate overfitting and generalizes across different device layouts by aligning WiFi and vision data in a shared 3D space.

Contribution

PerceptAlign is the first geometry-conditioned approach for WiFi pose estimation, enabling layout-invariant predictions through coordinate unification and device geometry encoding.

Findings

01

Reduces in-domain error by 12.3%

02

Decreases cross-domain error by over 60%

03

Constructs the largest cross-domain WiFi pose dataset to date

Abstract

WiFi-based 3D human pose estimation offers a low-cost and privacy-preserving alternative to vision-based systems for smart interaction. However, existing approaches rely on visual 3D poses as supervision and directly regress CSI to a camera-based coordinate system. We find that this practice leads to coordinate overfitting: models memorize deployment-specific WiFi transceiver layouts rather than only learning activity-relevant representations, resulting in severe generalization failures. To address this challenge, we present PerceptAlign, the first geometry-conditioned framework for WiFi-based cross-layout pose estimation. PerceptAlign introduces a lightweight coordinate unification procedure that aligns WiFi and vision measurements in a shared 3D space using only two checkerboards and a few photos. Within this unified space, it encodes calibrated transceiver positions into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndoor and Outdoor Localization Technologies · Context-Aware Activity Recognition Systems · Human Pose and Action Recognition