Conversation Group Detection With Spatio-Temporal Context
Stephanie Tan, David M.J. Tax, Hayley Hung

TL;DR
This paper introduces a deep learning approach using dynamic LSTM models and graph clustering to detect conversation groups in social scenarios from overhead video recordings, emphasizing temporal and spatial context.
Contribution
It presents a novel method leveraging temporal dynamics and continuous affinity predictions for improved conversation group detection in social settings.
Findings
Improved group detection accuracy with temporal granularity.
Affinity values correlate with conversation group membership.
Predicted affinities can forecast future group memberships.
Abstract
In this work, we propose an approach for detecting conversation groups in social scenarios like cocktail parties and networking events, from overhead camera recordings. We posit the detection of conversation groups as a learning problem that could benefit from leveraging the spatial context of the surroundings, and the inherent temporal context in interpersonal dynamics which is reflected in the temporal dynamics in human behavior signals, an aspect that has not been addressed in recent prior works. This motivates our approach which consists of a dynamic LSTM-based deep learning model that predicts continuous pairwise affinity values indicating how likely two people are in the same conversation group. These affinity values are also continuous in time, since relationships and group membership do not occur instantaneously, even though the ground truths of group membership are binary.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Opinion Dynamics and Social Influence · Anomaly Detection Techniques and Applications
