Debiasing Scores and Prompts of 2D Diffusion for View-consistent   Text-to-3D Generation

Susung Hong; Donghoon Ahn; Seungryong Kim

arXiv:2303.15413·cs.CV·December 21, 2023·5 cites

Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation

Susung Hong, Donghoon Ahn, Seungryong Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper addresses view inconsistency in text-to-3D generation by identifying biases in 2D diffusion models and proposing two debiasing methods, resulting in more realistic and view-consistent 3D objects.

Contribution

The paper introduces two novel debiasing approaches—score debiasing and prompt debiasing—to improve view consistency in text-to-3D generation.

Findings

01

Enhanced realism of 3D objects with reduced artifacts

02

Improved view consistency in generated 3D models

03

Achieved better balance between 2D fidelity and 3D coherence

Abstract

Existing score-distilling text-to-3D generation techniques, despite their considerable promise, often encounter the view inconsistency problem. One of the most notable issues is the Janus problem, where the most canonical view of an object (\textit{e.g}., face or head) appears in other views. In this work, we explore existing frameworks for score-distilling text-to-3D generation and identify the main causes of the view inconsistency problem -- the embedded bias of 2D diffusion models. Based on these findings, we propose two approaches to debias the score-distillation frameworks for view-consistent text-to-3D generation. Our first approach, called score debiasing, involves cutting off the score estimated by 2D diffusion models and gradually increasing the truncation value throughout the optimization process. Our second approach, called prompt debiasing, identifies conflicting words…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

threestudio-project/threestudio
jaxOfficial

Videos

Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Advanced Image and Video Retrieval Techniques

MethodsDiffusion