Spatiotemporal Emotional Synchrony in Dyadic Interactions: The Role of Speech Conditions in Facial and Vocal Affective Alignment
Von Ralph Dane Marquez Herbuela, Yukie Nagai

TL;DR
This study investigates how speech overlap influences emotional synchronization across facial and vocal channels in dyadic interactions, revealing that non-overlapping speech fosters more stable emotional alignment and highlighting the importance of conversational structure.
Contribution
The paper provides novel insights into the impact of speech overlap on multimodal emotional synchrony, using continuous emotion estimates and dynamic analysis methods on real-world interaction data.
Findings
Non overlapping speech enhances emotional synchrony stability.
Lag adjusted correlations show clearer temporal alignment in non-overlapping segments.
Facial expressions tend to lead speech during turn-taking, speech leads during simultaneous vocalizations.
Abstract
Understanding how humans express and synchronize emotions across multiple communication channels particularly facial expressions and speech has significant implications for emotion recognition systems and human computer interaction. Motivated by the notion that non-overlapping speech promotes clearer emotional coordination, while overlapping speech disrupts synchrony, this study examines how these conversational dynamics shape the spatial and temporal alignment of arousal and valence across facial and vocal modalities. Using dyadic interactions from the IEMOCAP dataset, we extracted continuous emotion estimates via EmoNet (facial video) and a Wav2Vec2-based model (speech audio). Segments were categorized based on speech overlap, and emotional alignment was assessed using Pearson correlation, lag adjusted analysis, and Dynamic Time Warping (DTW). Across analyses, non overlapping speech…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultisensory perception and integration · Language, Metaphor, and Cognition
MethodsDynamic Time Warping · Network On Network
