DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image   Diffusion for 3D Generative Model

Gwanghyun Kim; Se Young Chun

arXiv:2211.16374·cs.CV·April 3, 2023·1 cites

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model

Gwanghyun Kim, Se Young Chun

PDF

Open Access

TL;DR

DATID-3D introduces a novel text-to-image diffusion-based domain adaptation method for 3D generative models, preserving diversity and improving multi-view consistency without extra data, surpassing existing methods.

Contribution

It presents a new pipeline that fine-tunes 3D generators using diffusion models, enhancing diversity and text-image alignment in domain adaptation tasks.

Findings

01

Outperforms existing methods in diversity and text-image correspondence

02

Enables high-resolution, multi-view consistent 3D image synthesis

03

Supports diverse 3D image manipulations like one-shot adaptation

Abstract

Recent 3D generative models have achieved remarkable performance in synthesizing high resolution photorealistic images with view consistency and detailed 3D shapes, but training them for diverse domains is challenging since it requires massive training images and their camera distribution information. Text-guided domain adaptation methods have shown impressive performance on converting the 2D generative model on one domain into the models on other domains with different styles by leveraging the CLIP (Contrastive Language-Image Pre-training), rather than collecting massive datasets for those domains. However, one drawback of them is that the sample diversity in the original generative model is not well-preserved in the domain-adapted generative models due to the deterministic nature of the CLIP text encoder. Text-guided domain adaptation will be even more challenging for 3D generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning

MethodsContrastive Language-Image Pre-training · Diffusion