Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation
Yujie Zhang, Bingyang Cui, Qi Yang, Zhu Li, Yiling Xu

TL;DR
This paper introduces MATE-3D, a comprehensive benchmark for evaluating text-to-3D generation across multiple dimensions, and proposes HyperScore, a hypernetwork-based metric that effectively assesses multi-dimensional quality.
Contribution
The paper presents MATE-3D, a detailed benchmark with diverse prompt categories, and introduces HyperScore, a novel multi-dimensional quality evaluator utilizing hypernetworks for improved assessment.
Findings
HyperScore outperforms existing metrics on MATE-3D.
MATE-3D includes 1,280 textured meshes across 8 prompt categories.
Large-scale subjective evaluation collected over 107,520 annotations.
Abstract
Text-to-3D generation has achieved remarkable progress in recent years, yet evaluating these methods remains challenging for two reasons: i) Existing benchmarks lack fine-grained evaluation on different prompt categories and evaluation dimensions. ii) Previous evaluation metrics only focus on a single aspect (e.g., text-3D alignment) and fail to perform multi-dimensional quality assessment. To address these problems, we first propose a comprehensive benchmark named MATE-3D. The benchmark contains eight well-designed prompt categories that cover single and multiple object generation, resulting in 1,280 generated textured meshes. We have conducted a large-scale subjective experiment from four different evaluation dimensions and collected 107,520 annotations, followed by detailed analyses of the results. Based on MATE-3D, we propose a novel quality evaluator named HyperScore. Utilizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Semantic Web and Ontologies · Natural Language Processing Techniques
MethodsFocus · HyperNetwork
