Multimodal Ambivalence/Hesitancy Recognition in Videos for Personalized Digital Health Interventions
Manuela Gonz\'alez-Gonz\'alez, Soufiane Belharbi, Muhammad Osama Zeeshan, Masoumeh Sharafi, Muhammad Haseeb Aslam, Lorenzo Sia, Nicolas Richet, Marco Pedersoli, Alessandro Lameiras Koerich, Simon L Bacon, Eric Granger

TL;DR
This paper investigates deep learning approaches for recognizing ambivalence and hesitancy in videos to enhance personalized digital health interventions, highlighting current limitations and the need for improved multimodal models.
Contribution
It explores supervised, unsupervised, and zero-shot learning setups for A/H recognition in videos, using the new BAH dataset, and identifies challenges in current methods.
Findings
Limited performance of current models on A/H recognition.
Need for better multimodal fusion techniques.
Existing models struggle with subtle affective conflicts.
Abstract
Using behavioural science, health interventions focus on behaviour change by providing a framework to help patients acquire and maintain healthy habits that improve medical outcomes. In-person interventions are costly and difficult to scale, especially in resource-limited regions. Digital health interventions offer a cost-effective approach, potentially supporting independent living and self-management. Automating such interventions, especially through machine learning, has gained considerable attention recently. Ambivalence and hesitancy (A/H) play a primary role for individuals to delay, avoid, or abandon health interventions. A/H are subtle and conflicting emotions that place a person in a state between positive and negative evaluations of a behaviour, or between acceptance and refusal to engage in it. They manifest as affective inconsistency across modalities or within a modality,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
