Multimodal Negative Learning
Baoquan Gong, Xiyuan Gao, Pengfei Zhu, Qinghua Hu, Bing Cao

TL;DR
This paper introduces a novel multimodal learning paradigm called 'Learning Not to be' that uses negative guidance to improve robustness and preserve modality-specific information, especially in noisy or imbalanced settings.
Contribution
It proposes the Multimodal Negative Learning framework, a new approach that enhances robustness by dynamically guiding weak modalities to suppress non-target classes, avoiding over-alignment.
Findings
Tightens the robustness lower bound of multimodal learning.
Reduces empirical error of weak modalities under noise and imbalance.
Demonstrates effectiveness across multiple benchmarks.
Abstract
Multimodal learning systems often encounter challenges related to modality imbalance, where a dominant modality may overshadow others, thereby hindering the learning of weak modalities. Conventional approaches often force weak modalities to align with dominant ones in "Learning to be (the same)" (Positive Learning), which risks suppressing the unique information inherent in the weak modalities. To address this challenge, we offer a new learning paradigm: "Learning Not to be" (Negative Learning). Instead of enhancing weak modalities' target-class predictions, the dominant modalities dynamically guide the weak modality to suppress non-target classes. This stabilizes the decision space and preserves modality-specific information, allowing weak modalities to preserve unique information without being over-aligned. We proceed to reveal multimodal learning from a robustness perspective and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Imbalanced Data Classification Techniques
