MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences
Weitao Wang, Haoran Xu, Yuxiao Yang, Zhifang Liu, Jun Meng, Haoqian, Wang

TL;DR
This paper introduces MVReward, a reward model trained on human preferences to evaluate and improve multi-view diffusion models for 3D content generation, addressing evaluation fairness and alignment issues.
Contribution
The paper presents a new human-aligned reward model and a multi-view preference learning strategy to improve evaluation and tuning of multi-view diffusion models.
Findings
MVReward effectively encodes human preferences for multi-view assets.
The proposed MVP method improves alignment of diffusion models with human judgments.
Extensive experiments validate the reliability of MVReward as an evaluation metric.
Abstract
Recent years have witnessed remarkable progress in 3D content generation. However, corresponding evaluation methods struggle to keep pace. Automatic approaches have proven challenging to align with human preferences, and the mixed comparison of text- and image-driven methods often leads to unfair evaluations. In this paper, we present a comprehensive framework to better align and evaluate multi-view diffusion models with human preferences. To begin with, we first collect and filter a standardized image prompt set from DALLE and Objaverse, which we then use to generate multi-view assets with several multi-view diffusion models. Through a systematic ranking pipeline on these assets, we obtain a human annotation dataset with 16k expert pairwise comparisons and train a reward model, coined MVReward, to effectively encode human preferences. With MVReward, image-driven 3D methods can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
MethodsSparse Evolutionary Training · ALIGN · Diffusion
