Asymmetric Reinforcing against Multi-modal Representation Bias
Xiyuan Gao, Bing Cao, Pengfei Zhu, Nannan Wang, Qinghua Hu

TL;DR
This paper introduces ARM, a novel asymmetric reinforcement method that dynamically balances modality contributions in multimodal learning, addressing representation bias and improving overall performance.
Contribution
The paper proposes ARM, a method that adaptively reinforces weak modalities while preserving dominant ones, effectively mitigating multimodal bias and enhancing learning outcomes.
Findings
Significant performance improvements in multimodal tasks.
Effective reduction of modality contribution imbalance.
Enhanced utilization of full multimodal information.
Abstract
The strength of multimodal learning lies in its ability to integrate information from various sources, providing rich and comprehensive insights. However, in real-world scenarios, multi-modal systems often face the challenge of dynamic modality contributions, the dominance of different modalities may change with the environments, leading to suboptimal performance in multimodal learning. Current methods mainly enhance weak modalities to balance multimodal representation bias, which inevitably optimizes from a partialmodality perspective, easily leading to performance descending for dominant modalities. To address this problem, we propose an Asymmetric Reinforcing method against Multimodal representation bias (ARM). Our ARM dynamically reinforces the weak modalities while maintaining the ability to represent dominant modalities through conditional mutual information. Moreover, we provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsBIM and Construction Integration · Manufacturing Process and Optimization
