Garment Inertial Denoiser (GID): Endowing Accurate Motion Capture via Loose IMU Denoiser

Jiawei Fang; Ruonan Zheng; Xiaoxia Gao; Shifan Jiang; Anjun Chen; Qi Ye; Shihui Guo

arXiv:2601.01360·cs.CV·January 6, 2026

Garment Inertial Denoiser (GID): Endowing Accurate Motion Capture via Loose IMU Denoiser

Jiawei Fang, Ruonan Zheng, Xiaoxia Gao, Shifan Jiang, Anjun Chen, Qi Ye, Shihui Guo

PDF

Open Access

TL;DR

GID is a Transformer-based denoiser that improves the accuracy of loose-fitting garment inertial motion capture by addressing sensor displacement issues, enabling real-time, generalized motion estimation.

Contribution

The paper introduces GID, a novel Transformer architecture with location-specific denoising and cross-wear fusion for loose garment IMU-based motion capture, and a new dataset GarMoCap.

Findings

01

GID significantly improves motion capture accuracy over state-of-the-art methods.

02

GID generalizes well across users, motions, and garments.

03

Real-time denoising is achieved with limited training data.

Abstract

Wearable inertial motion capture (MoCap) provides a portable, occlusion-free, and privacy-preserving alternative to camera-based systems, but its accuracy depends on tightly attached sensors - an intrusive and uncomfortable requirement for daily use. Embedding IMUs into loose-fitting garments is a desirable alternative, yet sensor-body displacement introduces severe, structured, and location-dependent corruption that breaks standard inertial pipelines. We propose GID (Garment Inertial Denoiser), a lightweight, plug-and-play Transformer that factorizes loose-wear MoCap into three stages: (i) location-specific denoising, (ii) adaptive cross-wear fusion, and (iii) general pose prediction. GID uses a location-aware expert architecture, where a shared spatio-temporal backbone models global motion while per-IMU expert heads specialize in local garment dynamics, and a lightweight fusion module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Human Motion and Animation · Human Pose and Action Recognition