A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention
Shengjie Li, Yinhao Xiao

TL;DR
This paper presents a novel multi-modal feature fusion method using cross-attention and Transformer models to improve depression detection accuracy from social media data, achieving 94.95% accuracy.
Contribution
It introduces a cross-attention based multi-modal fusion approach with a specialized network, significantly enhancing depression detection performance over previous methods.
Findings
Achieved 94.95% accuracy on test data
Outperformed existing depression detection approaches
Demonstrated effectiveness of cross-attention in multi-modal fusion
Abstract
Depression, a prevalent and serious mental health issue, affects approximately 3.8\% of the global population. Despite the existence of effective treatments, over 75\% of individuals in low- and middle-income countries remain untreated, partly due to the challenge in accurately diagnosing depression in its early stages. This paper introduces a novel method for detecting depression based on multi-modal feature fusion utilizing cross-attention. By employing MacBERT as a pre-training model to extract lexical features from text and incorporating an additional Transformer module to refine task-specific contextual understanding, the model's adaptability to the targeted task is enhanced. Diverging from previous practices of simply concatenating multimodal features, this approach leverages cross-attention for feature integration, significantly improving the accuracy in depression detection and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies · Brain Tumor Detection and Classification · Advanced Computing and Algorithms
MethodsAttention Is All You Need · Residual Connection · Adam · Dropout · Byte Pair Encoding · Layer Normalization · MacBERT · Label Smoothing · Linear Layer · Softmax
