Latent Dynamics for Full Body Avatar Animation
Shichong Peng, Chengxiang Yin, Fei Jiang, Zhongshi Jiang, Lingchen Yang, Qingyang Tan, Amin Jourabloo, Jason Saragih, Ke Li, Christian H\"ane

TL;DR
This paper introduces a novel neural avatar animation method that models temporal clothing dynamics using a transformer-based residual latent, improving realism and diversity in full-body avatar motion synthesis.
Contribution
It proposes a new approach combining a pose-conditioned 3D Gaussian avatar with a dynamics residual latent and a transformer decoder to better capture temporal clothing variations.
Findings
Improved animation quality over recent baselines.
Diverse plausible motion trajectories generated from different initial conditions.
Temporal coherence achieved with negligible runtime cost.
Abstract
Pose-driven full-body avatars built on neural rendering produce high-quality novel views of a captured subject. Yet loose clothing and other dynamic elements deform in ways pose alone cannot explain: the same pose can correspond to many different states, because their motion depends on history, inertia, and contact. Explicit simulation and layered-garment methods can model such dynamics, but they require either a dedicated garment template, which raw multi-view capture does not naturally provide, or a test-time physics simulator with non-trivial runtime cost. A parallel line of work learns data-driven clothing avatars that avoid explicit garment layers. These methods add an auxiliary latent for variation beyond pose; at inference, they fix it, regress it from pose, or retrieve it from training data, without explicitly modeling how the latent evolves with its own dynamics. Additionally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
