Distributionally Robust Multimodal Machine Learning

Peilin Yang; Yu Ma

arXiv:2511.05716·cs.LG·November 11, 2025

Distributionally Robust Multimodal Machine Learning

Peilin Yang, Yu Ma

PDF

Open Access

TL;DR

This paper introduces a distributionally robust optimization framework for multimodal machine learning, providing theoretical guarantees and empirical improvements in robustness for high-stakes applications.

Contribution

It proposes a novel DRO framework that captures modality-aware effects, with theoretical analysis and empirical validation demonstrating enhanced robustness.

Findings

01

Improved robustness in simulation and real-world datasets.

02

Theoretical performance guarantees through bounds.

03

Extension to encoder-specific error propagation settings.

Abstract

We consider the problem of distributionally robust multimodal machine learning. Existing approaches often rely on merging modalities on the feature level (early fusion) or heuristic uncertainty modeling, which downplays modality-aware effects and provide limited insights. We propose a novel distributionally robust optimization (DRO) framework that aims to study both the theoretical and practical insights of multimodal machine learning. We first justify this setup and show the significance of this problem through complexity analysis. We then establish both generalization upper bounds and minimax lower bounds which provide performance guarantees. These results are further extended in settings where we consider encoder-specific error propogations. Empirically, we demonstrate that our approach improves robustness in both simulation settings and real-world datasets. Together, these findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques