ADMC: Attention-based Diffusion Model for Missing Modalities Feature Completion
Wei Zhang, Juan Chen, Yanbo J. Wang, En Zhu, Xuan Yang, Yiduo Wang

TL;DR
This paper introduces ADMC, an attention-based diffusion model that effectively completes missing modality features in multimodal emotion and intent recognition, improving accuracy in both missing and complete data scenarios.
Contribution
The paper presents a novel attention-based diffusion framework that independently trains modality-specific feature extractors and generates missing features, avoiding over-coupling and improving multimodal recognition.
Findings
Achieves state-of-the-art results on IEMOCAP and MIntRec benchmarks.
Effectively handles various missing modality scenarios.
Enhances recognition accuracy even with complete modalities.
Abstract
Multimodal emotion and intent recognition is essential for automated human-computer interaction, It aims to analyze users' speech, text, and visual information to predict their emotions or intent. One of the significant challenges is that missing modalities due to sensor malfunctions or incomplete data. Traditional methods that attempt to reconstruct missing information often suffer from over-coupling and imprecise generation processes, leading to suboptimal outcomes. To address these issues, we introduce an Attention-based Diffusion model for Missing Modalities feature Completion (ADMC). Our framework independently trains feature extraction networks for each modality, preserving their unique characteristics and avoiding over-coupling. The Attention-based Diffusion Network (ADN) generates missing modality features that closely align with authentic multimodal distribution, enhancing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Multimodal Machine Learning Applications
MethodsALIGN · Diffusion
