Balanced Multimodal Learning via Mutual Information
Rongrong Xie, Guido Sanguinetti

TL;DR
This paper introduces a unified framework for balanced multimodal learning that uses mutual information to address modality imbalance, improving performance especially in biological data analysis.
Contribution
The study proposes a novel approach combining cross-modal knowledge distillation and multitask-like training to effectively mitigate modality imbalance in multimodal learning.
Findings
Enhanced model performance on imbalanced multimodal datasets
Effective utilization of mutual information for modality calibration
Improved predictive accuracy in biological data analysis
Abstract
Multimodal learning has increasingly become a focal point in research, primarily due to its ability to integrate complementary information from diverse modalities. Nevertheless, modality imbalance, stemming from factors such as insufficient data acquisition and disparities in data quality, has often been inadequately addressed. This issue is particularly prominent in biological data analysis, where datasets are frequently limited, costly to acquire, and inherently heterogeneous in quality. Conventional multimodal methodologies typically fall short in concurrently harnessing intermodal synergies and effectively resolving modality conflicts. In this study, we propose a novel unified framework explicitly designed to address modality imbalance by utilizing mutual information to quantify interactions between modalities. Our approach adopts a balanced multimodal learning strategy comprising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Domain Adaptation and Few-Shot Learning · Topic Modeling
