DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Zhenglin Zhou, Xiaobo Xia, Fan Ma, Hehe Fan, Yi Yang, Tat-Seng Chua

TL;DR
DreamDPO introduces a novel optimization framework that aligns text-to-3D content with human preferences using pairwise comparisons and preference-driven loss, resulting in more controllable and higher-quality 3D generation.
Contribution
It presents the first preference optimization approach for text-to-3D generation, improving alignment with human preferences and controllability over existing methods.
Findings
Achieves higher-quality 3D content than existing methods
Enables fine-grained control through preference-guided optimization
Reduces reliance on pointwise quality evaluations
Abstract
Text-to-3D generation automates 3D content creation from textual descriptions, which offers transformative potential across various fields. However, existing methods often struggle to align generated content with human preferences, limiting their applicability and flexibility. To address these limitations, in this paper, we propose DreamDPO, an optimization-based framework that integrates human preferences into the 3D generation process, through direct preference optimization. Practically, DreamDPO first constructs pairwise examples, then compare their alignment with human preferences using reward or large multimodal models, and lastly optimizes the 3D representation with a preference-driven loss function. By leveraging pairwise comparison to reflect preferences, DreamDPO reduces reliance on precise pointwise quality evaluations while enabling fine-grained controllability through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Motion and Animation · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis
MethodsALIGN
