Learning and Evaluating Human Preferences for Conversational Head Generation
Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

TL;DR
This paper introduces Preference Score, a learning-based metric that accurately predicts human preferences in conversational head video synthesis, enabling efficient and reliable evaluation without manual annotation.
Contribution
The paper presents a novel Preference Score metric that aligns with human preferences and can be used for automatic evaluation of conversational head generation.
Findings
Preference Score correlates well with human judgments.
The metric is robust across different datasets.
It reduces the need for labor-intensive user studies.
Abstract
A reliable and comprehensive evaluation metric that aligns with manual preference assessments is crucial for conversational head video synthesis methods development. Existing quantitative evaluations often fail to capture the full complexity of human preference, as they only consider limited evaluation dimensions. Qualitative evaluations and user studies offer a solution but are time-consuming and labor-intensive. This limitation hinders the advancement of conversational head generation algorithms and systems. In this paper, we propose a novel learning-based evaluation metric named Preference Score (PS) for fitting human preference according to the quantitative evaluations across different dimensions. PS can serve as a quantitative evaluation without the need for human annotation. Experimental results validate the superiority of Preference Score in aligning with human perception, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · AI in Service Interactions · Social Robot Interaction and HRI
Methodsfail
