Asymmetric Reinforcing against Multi-modal Representation Bias

Xiyuan Gao; Bing Cao; Pengfei Zhu; Nannan Wang; Qinghua Hu

arXiv:2501.01240·cs.CV·January 3, 2025

Asymmetric Reinforcing against Multi-modal Representation Bias

Xiyuan Gao, Bing Cao, Pengfei Zhu, Nannan Wang, Qinghua Hu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ARM, a novel asymmetric reinforcement method that dynamically balances modality contributions in multimodal learning, addressing representation bias and improving overall performance.

Contribution

The paper proposes ARM, a method that adaptively reinforces weak modalities while preserving dominant ones, effectively mitigating multimodal bias and enhancing learning outcomes.

Findings

01

Significant performance improvements in multimodal tasks.

02

Effective reduction of modality contribution imbalance.

03

Enhanced utilization of full multimodal information.

Abstract

The strength of multimodal learning lies in its ability to integrate information from various sources, providing rich and comprehensive insights. However, in real-world scenarios, multi-modal systems often face the challenge of dynamic modality contributions, the dominance of different modalities may change with the environments, leading to suboptimal performance in multimodal learning. Current methods mainly enhance weak modalities to balance multimodal representation bias, which inevitably optimizes from a partialmodality perspective, easily leading to performance descending for dominant modalities. To address this problem, we propose an Asymmetric Reinforcing method against Multimodal representation bias (ARM). Our ARM dynamically reinforces the weak modalities while maintaining the ability to represent dominant modalities through conditional mutual information. Moreover, we provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gao-xiyuan/arm
pytorchOfficial

Videos

Asymmetric Reinforcing against Multi-modal Representation Bias· underline

Taxonomy

TopicsBIM and Construction Integration · Manufacturing Process and Optimization