MAGE:A Multi-stage Avatar Generator with Sparse Observations

Fangyu Du; Yang Yang; Xuehao Gao; Hongye Hou

arXiv:2505.06411·cs.CV·May 13, 2025

MAGE:A Multi-stage Avatar Generator with Sparse Observations

Fangyu Du, Yang Yang, Xuehao Gao, Hongye Hou

PDF

Open Access

TL;DR

MAGE is a multi-stage avatar generator that progressively infers full-body poses from sparse head-mounted device observations, improving motion realism and temporal consistency in AR/VR applications.

Contribution

It introduces a multi-stage, progressive prediction strategy that factorizes the motion mapping process, enhancing accuracy and realism over previous one-stage methods.

Findings

01

Outperforms state-of-the-art methods in accuracy

02

Produces more realistic and temporally consistent motions

03

Effective in large-scale dataset evaluations

Abstract

Inferring full-body poses from Head Mounted Devices, which capture only 3-joint observations from the head and wrists, is a challenging task with wide AR/VR applications. Previous attempts focus on learning one-stage motion mapping and thus suffer from an over-large inference space for unobserved body joint motions. This often leads to unsatisfactory lower-body predictions and poor temporal consistency, resulting in unrealistic or incoherent motion sequences. To address this, we propose a powerful Multi-stage Avatar GEnerator named MAGE that factorizes this one-stage direct motion mapping learning with a progressive prediction strategy. Specifically, given initial 3-joint motions, MAGE gradually inferring multi-scale body part poses at different abstract granularity levels, starting from a 6-part body representation and gradually refining to 22 joints. With decreasing abstract levels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis

MethodsFocus