TL;DR
This paper systematically reviews evaluation practices for co-speech gesture generation in embodied conversational agents, highlighting the need for standardized, rigorous assessment tools and proposing a checklist for consistent reporting.
Contribution
It provides a comprehensive review of existing evaluation methods, identifies gaps, and offers recommendations and a checklist to improve consistency and rigor in gesture evaluation studies.
Findings
Most studies use subjective, within-subject evaluation methods
Lack of standardized evaluation tools across studies
Recommendations for systematic testing and reporting of gesture generation models
Abstract
Embodied conversational agents (ECA) are often designed to produce nonverbal behavior to complement or enhance their verbal communication. One such form of nonverbal behavior is co-speech gesturing, which involves movements that the agent makes with its arms and hands that are paired with verbal communication. Co-speech gestures for ECAs can be created using different generation methods, divided into rule-based and data-driven processes, with the latter gaining traction because of the increasing interest from the applied machine learning community. However, reports on gesture generation methods use a variety of evaluation measures, which hinders comparison. To address this, we present a systematic review on co-speech gesture generation methods for iconic, metaphoric, deictic, and beat gestures, including reported evaluation methods. We review 22 studies that have an ECA with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
