PuzzleAvatar: Assembling 3D Avatars from Personal Albums
Yuliang Xiu, Yufei Ye, Zhen Liu, Dimitrios Tzionas, Michael J. Black

TL;DR
PuzzleAvatar is a novel method that creates accurate 3D avatars from casual personal photo albums by fine-tuning vision-language models, bypassing pose estimation challenges, and enabling easy customization.
Contribution
The paper introduces PuzzleAvatar, a new approach that generates faithful 3D avatars from personal photo collections without requiring full-body images or pose estimation, and provides a new dataset for this task.
Findings
Outperforms existing methods like TeCH and MVDreamBooth in reconstruction accuracy.
Demonstrates high scalability and robustness to diverse casual photos.
Enables avatar customization by swapping learned tokens.
Abstract
Generating personalized 3D avatars is crucial for AR/VR. However, recent text-to-3D methods that generate avatars for celebrities or fictional characters, struggle with everyday people. Methods for faithful reconstruction typically require full-body images in controlled settings. What if a user could just upload their personal "OOTD" (Outfit Of The Day) photo collection and get a faithful avatar in return? The challenge is that such casual photo collections contain diverse poses, challenging viewpoints, cropped views, and occlusion (albeit with a consistent outfit, accessories and hairstyle). We address this novel "Album2Human" task by developing PuzzleAvatar, a novel model that generates a faithful 3D avatar (in a canonical pose) from a personal OOTD album, while bypassing the challenging estimation of body and camera pose. To this end, we fine-tune a foundational vision-language model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Augmented Reality Applications · Artificial Intelligence in Games
