Learning Optimal Multimodal Information Bottleneck Representations

Qilong Wu; Yiyang Shao; Jun Wang; Xiaobo Sun

arXiv:2505.19996·cs.LG·May 27, 2025

Learning Optimal Multimodal Information Bottleneck Representations

Qilong Wu, Yiyang Shao, Jun Wang, Xiaobo Sun

PDF

Open Access

TL;DR

This paper introduces OMIB, a novel multimodal learning framework that guarantees optimal information bottleneck representations by theoretically bounding regularization weights and dynamically balancing modality contributions, leading to improved task performance.

Contribution

The paper proposes OMIB, a new method that ensures optimal multimodal information bottleneck representations with a theoretically grounded regularization scheme and dynamic modality balancing.

Findings

01

OMIB guarantees the achievability of optimal MIB.

02

OMIB outperforms state-of-the-art benchmarks in downstream tasks.

03

Theoretical validation on synthetic data supports OMIB's effectiveness.

Abstract

Leveraging high-quality joint representations from multimodal data can greatly enhance model performance in various machine-learning based applications. Recent multimodal learning methods, based on the multimodal information bottleneck (MIB) principle, aim to generate optimal MIB with maximal task-relevant information and minimal superfluous information via regularization. However, these methods often set ad hoc regularization weights and overlook imbalanced task-relevant information across modalities, limiting their ability to achieve optimal MIB. To address this gap, we propose a novel multimodal learning framework, Optimal Multimodal Information Bottleneck (OMIB), whose optimization objective guarantees the achievability of optimal MIB by setting the regularization weight within a theoretically derived bound. OMIB further addresses imbalanced task-relevant information by dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Spam and Phishing Detection

MethodsHigh-Order Consensuses · Sparse Evolutionary Training