DreamDPO: Aligning Text-to-3D Generation with Human Preferences via   Direct Preference Optimization

Zhenglin Zhou; Xiaobo Xia; Fan Ma; Hehe Fan; Yi Yang; Tat-Seng Chua

arXiv:2502.04370·cs.CL·February 10, 2025

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Zhenglin Zhou, Xiaobo Xia, Fan Ma, Hehe Fan, Yi Yang, Tat-Seng Chua

PDF

Open Access 1 Video

TL;DR

DreamDPO introduces a novel optimization framework that aligns text-to-3D content with human preferences using pairwise comparisons and preference-driven loss, resulting in more controllable and higher-quality 3D generation.

Contribution

It presents the first preference optimization approach for text-to-3D generation, improving alignment with human preferences and controllability over existing methods.

Findings

01

Achieves higher-quality 3D content than existing methods

02

Enables fine-grained control through preference-guided optimization

03

Reduces reliance on pointwise quality evaluations

Abstract

Text-to-3D generation automates 3D content creation from textual descriptions, which offers transformative potential across various fields. However, existing methods often struggle to align generated content with human preferences, limiting their applicability and flexibility. To address these limitations, in this paper, we propose DreamDPO, an optimization-based framework that integrates human preferences into the 3D generation process, through direct preference optimization. Practically, DreamDPO first constructs pairwise examples, then compare their alignment with human preferences using reward or large multimodal models, and lastly optimizes the 3D representation with a preference-driven loss function. By leveraging pairwise comparison to reflect preferences, DreamDPO reduces reliance on precise pointwise quality evaluations while enabling fine-grained controllability through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization· slideslive

Taxonomy

TopicsHuman Motion and Animation · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis

MethodsALIGN