Disentangled Clothed Avatar Generation with Layered Representation
Weitian Zhang, Yichao Yan, Sijing Wu, Manwen Liao, Xiaokang Yang

TL;DR
LayerAvatar is a novel diffusion-based method that generates high-resolution, disentangled clothed avatars with controllable features using a layered UV representation, enabling real-time rendering and animation.
Contribution
It introduces a layered UV feature plane representation and a single-stage diffusion model for component-disentangled avatar generation, addressing occlusion issues.
Findings
High-quality, disentangled avatar generation demonstrated
Supports real-time rendering and expressive animation
Effective component transfer capabilities
Abstract
Clothed avatar generation has wide applications in virtual and augmented reality, filmmaking, and more. Previous methods have achieved success in generating diverse digital avatars, however, generating avatars with disentangled components (\eg, body, hair, and clothes) has long been a challenge. In this paper, we propose LayerAvatar, the first feed-forward diffusion-based method for generating component-disentangled clothed avatars. To achieve this, we first propose a layered UV feature plane representation, where components are distributed in different layers of the Gaussian-based UV feature plane with corresponding semantic labels. This representation supports high-resolution and real-time rendering, as well as expressive animation including controllable gestures and facial expressions. Based on the well-designed representation, we train a single-stage diffusion model and introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Human Motion and Animation
MethodsDiffusion
