Rethinking Multi-Modal Learning from Gradient Uncertainty

Peizheng Guo; Jingyao Wang; Wenwen Qiang; Jiahuan Zhou; Changwen Zheng; Gang Hua

arXiv:2505.23071·cs.LG·February 9, 2026

Rethinking Multi-Modal Learning from Gradient Uncertainty

Peizheng Guo, Jingyao Wang, Wenwen Qiang, Jiahuan Zhou, Changwen Zheng, Gang Hua

PDF

Open Access

TL;DR

This paper introduces a Bayesian gradient calibration method for multi-modal learning that models gradient uncertainty to improve optimization, addressing persistent performance fluctuations.

Contribution

It proposes a novel Bayesian-oriented approach that explicitly models gradient uncertainty and adaptively weights gradients, enhancing multi-modal learning optimization.

Findings

01

Improved predictive accuracy across multiple datasets

02

Effective handling of gradient reliability fluctuations

03

Enhanced robustness in multi-modal learning scenarios

Abstract

Multi-Modal Learning (MML) integrates information from diverse modalities to improve predictive accuracy. While existing optimization strategies have made significant strides by mitigating gradient direction conflicts, we revisit MML from a gradient-based perspective to explore further improvements. Empirically, we observe an interesting phenomenon: performance fluctuations can persist in both conflict and non-conflict settings. Based on this, we argue that: beyond gradient direction, the intrinsic reliability of gradients acts as a decisive factor in optimization, necessitating the explicit modeling of gradient uncertainty. Guided by this insight, we propose Bayesian-Oriented Gradient Calibration for MML (BOGC-MML). Our approach explicitly models gradients as probability distributions to capture uncertainty, interpreting their precision as evidence within the framework of subjective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification