Understanding the Predictability of Gesture Parameters from Speech and their Perceptual Importance
Ylva Ferstl, Michael Neff, Rachel McDonnell

TL;DR
This study investigates how speech relates to gesture parameters and their perceptual importance, revealing partial predictability of gestures from speech and varying impacts of gesture features on perception.
Contribution
It introduces a detailed analysis of gesture parameters' predictability from speech and their perceptual significance, advancing understanding beyond black-box gesture generators.
Findings
Gesture parameters can be partially predicted from speech.
Some parameters, like path length, are predicted more accurately.
Degradation of any gesture parameter negatively affects perceived appropriateness.
Abstract
Gesture behavior is a natural part of human conversation. Much work has focused on removing the need for tedious hand-animation to create embodied conversational agents by designing speech-driven gesture generators. However, these generators often work in a black-box manner, assuming a general relationship between input speech and output motion. As their success remains limited, we investigate in more detail how speech may relate to different aspects of gesture motion. We determine a number of parameters characterizing gesture, such as speed and gesture size, and explore their relationship to the speech signal in a two-fold manner. First, we train multiple recurrent networks to predict the gesture parameters from speech to understand how well gesture attributes can be modeled from speech alone. We find that gesture parameters can be partially predicted from speech, and some parameters,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
