DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
Gwanghyun Kim, Se Young Chun

TL;DR
DATID-3D introduces a novel text-to-image diffusion-based domain adaptation method for 3D generative models, preserving diversity and improving multi-view consistency without extra data, surpassing existing methods.
Contribution
It presents a new pipeline that fine-tunes 3D generators using diffusion models, enhancing diversity and text-image alignment in domain adaptation tasks.
Findings
Outperforms existing methods in diversity and text-image correspondence
Enables high-resolution, multi-view consistent 3D image synthesis
Supports diverse 3D image manipulations like one-shot adaptation
Abstract
Recent 3D generative models have achieved remarkable performance in synthesizing high resolution photorealistic images with view consistency and detailed 3D shapes, but training them for diverse domains is challenging since it requires massive training images and their camera distribution information. Text-guided domain adaptation methods have shown impressive performance on converting the 2D generative model on one domain into the models on other domains with different styles by leveraging the CLIP (Contrastive Language-Image Pre-training), rather than collecting massive datasets for those domains. However, one drawback of them is that the sample diversity in the original generative model is not well-preserved in the domain-adapted generative models due to the deterministic nature of the CLIP text encoder. Text-guided domain adaptation will be even more challenging for 3D generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
MethodsContrastive Language-Image Pre-training · Diffusion
