A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment
Megan M. Willi, Stephanie A. Borrie, Tyson S. Barrett, Ming Tu, and, Visar Berisha

TL;DR
This paper introduces a discriminative acoustic-prosodic method to measure local entrainment in conversations, which predicts conversational success with 72% accuracy, outperforming previous approaches.
Contribution
It proposes a novel turn-by-turn local entrainment measurement method based on discriminative features that distinguish real from sham conversations.
Findings
Achieved 72% classification accuracy in predicting conversational success.
Outperformed three previous methods on the same dataset.
Demonstrated the effectiveness of turn-by-turn acoustic-prosodic features.
Abstract
Acoustic-prosodic entrainment describes the tendency of humans to align or adapt their speech acoustics to each other in conversation. This alignment of spoken behavior has important implications for conversational success. However, modeling the subtle nature of entrainment in spoken dialogue continues to pose a challenge. In this paper, we propose a straightforward definition for local entrainment in the speech domain and operationalize an algorithm based on this: acoustic-prosodic features that capture entrainment should be maximally different between real conversations involving two partners and sham conversations generated by randomly mixing the speaking turns from the original two conversational partners. We propose an approach for measuring local entrainment that quantifies alignment of behavior on a turn-by-turn basis, projecting the differences between interlocutors'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
