AudioInsight: Detecting Social Contexts Relevant to Social Anxiety from Speech
Varun Reddy, Zhiyuan Wang, Emma Toner, Max Larrazabal, Mehdi, Boukhechba, Bethany A. Teachman, Laura E. Barnes

TL;DR
This study develops a deep learning approach to detect social interaction types and evaluative threat levels from ambient audio in virtual social settings, aiding mental health support for socially anxious individuals.
Contribution
It introduces a novel ambient audio-based method to classify social contexts and evaluative threat, demonstrating high accuracy in virtual interactions with potential for real-world application.
Findings
Dyadic vs. group interaction detection at 90% accuracy
Evaluative threat detection at 83% accuracy
Method generalizes beyond virtual settings
Abstract
During social interactions, understanding the intricacies of the context can be vital, particularly for socially anxious individuals. While previous research has found that the presence of a social interaction can be detected from ambient audio, the nuances within social contexts, which influence how anxiety provoking interactions are, remain largely unexplored. As an alternative to traditional, burdensome methods like self-report, this study presents a novel approach that harnesses ambient audio segments to detect social threat contexts. We focus on two key dimensions: number of interaction partners (dyadic vs. group) and degree of evaluative threat (explicitly evaluative vs. not explicitly evaluative). Building on data from a Zoom-based social interaction study (N=52 college students, of whom the majority N=45 are socially anxious), we employ deep learning methods to achieve strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStuttering Research and Treatment
MethodsFocus
