Modeling social interaction dynamics using temporal graph networks
J. Taery Kim, Archit Naik, Isuru Jayarathne, Sehoon Ha, Jouh Yeong, Chew

TL;DR
This paper introduces an adapted Temporal Graph Network model that effectively captures social interaction dynamics using multi-modal data, significantly improving prediction accuracy while reducing computational complexity for human-robot interaction tasks.
Contribution
The paper presents a novel, efficient temporal graph network approach for modeling social interactions, incorporating multi-modal data and outperforming baseline models in prediction tasks.
Findings
F1-score improved by 37.0% over baseline
Next speaker prediction improved by 29.0%
Reduced message passing from 768 to 14 elements
Abstract
Integrating intelligent systems, such as robots, into dynamic group settings poses challenges due to the mutual influence of human behaviors and internal states. A robust representation of social interaction dynamics is essential for effective human-robot collaboration. Existing approaches often narrow their focus to facial expressions or speech, overlooking the broader context. We propose employing an adapted Temporal Graph Networks to comprehensively represent social interaction dynamics while enabling its practical implementation. Our method incorporates temporal multi-modal behavioral data including gaze interaction, voice activity and environmental context. This representation of social interaction dynamics is trained as a link prediction problem using annotated gaze interaction data. The F1-score outperformed the baseline model by 37.0%. This improvement is consistent for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Emotion and Mood Recognition
