Connecting Consistency Distillation to Score Distillation for Text-to-3D   Generation

Zongrui Li; Minghui Hu; Qian Zheng; Xudong Jiang

arXiv:2407.13584·cs.CV·July 23, 2024

Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation

Zongrui Li, Minghui Hu, Qian Zheng, Xudong Jiang

PDF

Open Access 1 Repo

TL;DR

This paper analyzes score distillation in text-to-3D generation, introduces Guided Consistency Sampling and Brightness-Equalized Generation to improve detail and fidelity, and demonstrates superior results over state-of-the-art methods.

Contribution

It connects consistency distillation theory to score distillation, proposing GCS and BEG to enhance 3D generation quality.

Findings

01

GCS improves detail and fidelity in 3D assets.

02

BEG mitigates brightness oversaturation in rendering.

03

Proposed methods outperform existing state-of-the-art techniques.

Abstract

Although recent advancements in text-to-3D generation have significantly improved generation quality, issues like limited level of detail and low fidelity still persist, which requires further improvement. To understand the essence of those issues, we thoroughly analyze current score distillation methods by connecting theories of consistency distillation to score distillation. Based on the insights acquired through analysis, we propose an optimization framework, Guided Consistency Sampling (GCS), integrated with 3D Gaussian Splatting (3DGS) to alleviate those issues. Additionally, we have observed the persistent oversaturation in the rendered views of generated 3D assets. From experiments, we find that it is caused by unwanted accumulated brightness in 3DGS during optimization. To mitigate this issue, we introduce a Brightness-Equalized Generation (BEG) scheme in 3DGS rendering.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LMozart/ECCV2024-GCS-BEG
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Handwritten Text Recognition Techniques · Image Processing and 3D Reconstruction