BROTHER: Behavioral Recognition Optimized Through Heterogeneous Ensemble Regularization for Ambivalence and Hesitancy
Alexandre Pereira, Bruno Fernandes, Pablo Barros

TL;DR
This paper introduces BROTHER, a multimodal ensemble approach with regularization and optimization techniques to accurately recognize complex behavioral states like Ambivalence and Hesitancy in naturalistic videos.
Contribution
It proposes a novel regularized multimodal fusion pipeline and a PSO-based ensemble method to improve behavioral recognition accuracy in challenging real-world settings.
Findings
Linguistic features are the strongest predictor of A/H.
The PSO ensemble achieves a Macro F1-score of 0.7465.
Regularization effectively suppresses overfitting in multimodal fusion.
Abstract
Recognizing complex behavioral states such as Ambivalence and Hesitancy (A/H) in naturalistic video settings remains a significant challenge in affective computing. Unlike basic facial expressions, A/H manifests as subtle, multimodal conflicts that require deep contextual and temporal understanding. In this paper, we propose a highly regularized, multimodal fusion pipeline to predict A/H at the video level. We extract robust unimodal features from visual, acoustic, and linguistic data, introducing a specialized statistical text modality explicitly designed to capture temporal speech variations and behavioral cues. To identify the most effective representations, we evaluate 15 distinct modality combinations across a committee of machine learning classifiers (MLP, Random Forest, and GBDT), selecting the most well-calibrated models based on validation Binary Cross-Entropy (BCE) loss.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Speech and Audio Processing · Face recognition and analysis
