MVReward: Better Aligning and Evaluating Multi-View Diffusion Models   with Human Preferences

Weitao Wang; Haoran Xu; Yuxiao Yang; Zhifang Liu; Jun Meng; Haoqian; Wang

arXiv:2412.06614·cs.CV·December 10, 2024

MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences

Weitao Wang, Haoran Xu, Yuxiao Yang, Zhifang Liu, Jun Meng, Haoqian, Wang

PDF

Open Access

TL;DR

This paper introduces MVReward, a reward model trained on human preferences to evaluate and improve multi-view diffusion models for 3D content generation, addressing evaluation fairness and alignment issues.

Contribution

The paper presents a new human-aligned reward model and a multi-view preference learning strategy to improve evaluation and tuning of multi-view diffusion models.

Findings

01

MVReward effectively encodes human preferences for multi-view assets.

02

The proposed MVP method improves alignment of diffusion models with human judgments.

03

Extensive experiments validate the reliability of MVReward as an evaluation metric.

Abstract

Recent years have witnessed remarkable progress in 3D content generation. However, corresponding evaluation methods struggle to keep pace. Automatic approaches have proven challenging to align with human preferences, and the mixed comparison of text- and image-driven methods often leads to unfair evaluations. In this paper, we present a comprehensive framework to better align and evaluate multi-view diffusion models with human preferences. To begin with, we first collect and filter a standardized image prompt set from DALL $\cdot$ E and Objaverse, which we then use to generate multi-view assets with several multi-view diffusion models. Through a systematic ranking pipeline on these assets, we obtain a human annotation dataset with 16k expert pairwise comparisons and train a reward model, coined MVReward, to effectively encode human preferences. With MVReward, image-driven 3D methods can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications

MethodsSparse Evolutionary Training · ALIGN · Diffusion