RedCore: Relative Advantage Aware Cross-modal Representation Learning for Missing Modalities with Imbalanced Missing Rates
Jun Sun, Xinxin Zhang, Shoukang Han, Yu-ping Ruan, Taihao Li

TL;DR
RedCore introduces a novel cross-modal learning method that adaptively handles missing data and imbalanced missing rates across modalities, improving robustness in multimodal tasks.
Contribution
The paper proposes RedCore, a new approach using variational information bottleneck and relative advantage to effectively learn from incomplete and imbalanced multimodal data.
Findings
RedCore outperforms existing models under large missing rates.
RedCore demonstrates robustness to imbalanced missing modalities.
Empirical results validate the effectiveness of the bi-level optimization.
Abstract
Multimodal learning is susceptible to modality missing, which poses a major obstacle for its practical applications and, thus, invigorates increasing research interest. In this paper, we investigate two challenging problems: 1) when modality missing exists in the training data, how to exploit the incomplete samples while guaranteeing that they are properly supervised? 2) when the missing rates of different modalities vary, causing or exacerbating the imbalance among modalities, how to address the imbalance and ensure all modalities are well-trained? To tackle these two challenges, we first introduce the variational information bottleneck (VIB) method for the cross-modal representation learning of missing modalities, which capitalizes on the available modalities and the labels as supervision. Then, accounting for the imbalanced missing rates, we define relative advantage to quantify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Subtitles and Audiovisual Media · Cancer-related molecular mechanisms research
MethodsAttentive Walk-Aggregating Graph Neural Network
