DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models
Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang

TL;DR
DreamVTON is a novel 3D virtual try-on framework that separately optimizes geometry and texture using personalized diffusion models and template-based mechanisms, enabling realistic multi-view 3D human generation from images.
Contribution
It introduces a new method combining personalized diffusion models with template-based optimization for improved 3D human try-on.
Findings
Effective separation of geometry and texture optimization.
Enhanced multi-view consistency in 3D human models.
Improved realism in 3D virtual try-on results.
Abstract
Image-based 3D Virtual Try-ON (VTON) aims to sculpt the 3D human according to person and clothes images, which is data-efficient (i.e., getting rid of expensive 3D data) but challenging. Recent text-to-3D methods achieve remarkable improvement in high-fidelity 3D human generation, demonstrating its potential for 3D virtual try-on. Inspired by the impressive success of personalized diffusion models (e.g., Dreambooth and LoRA) for 2D VTON, it is straightforward to achieve 3D VTON by integrating the personalization technique into the diffusion-based text-to-3D framework. However, employing the personalized module in a pre-trained diffusion model (e.g., StableDiffusion (SD)) would degrade the model's capability for multi-view or multi-domain synthesis, which is detrimental to the geometry and texture optimization guided by Score Distillation Sampling (SDS) loss. In this work, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
MethodsDiffusion
