Deep Multimodal Learning with Missing Modality: A Survey

Renjie Wu; Hu Wang; Hsiang-Ting Chen; Gustavo Carneiro

arXiv:2409.07825·cs.CV·February 5, 2026·3 cites

Deep Multimodal Learning with Missing Modality: A Survey

Renjie Wu, Hu Wang, Hsiang-Ting Chen, Gustavo Carneiro

PDF

Open Access

TL;DR

This survey reviews recent deep learning approaches for multimodal learning when some data modalities are missing, highlighting methods, applications, datasets, challenges, and future directions to improve model robustness.

Contribution

It provides the first comprehensive overview of deep multimodal learning with missing modalities, clarifying its motivation, distinctions, and current research landscape.

Findings

01

Summarizes recent deep learning methods for MLMM

02

Analyzes applications and datasets in MLMM

03

Discusses challenges and future research directions

Abstract

During multimodal model training and testing, certain data modalities may be absent due to sensor limitations, cost constraints, privacy concerns, or data loss, negatively affecting performance. Multimodal learning techniques designed to handle missing modalities can mitigate this by ensuring model robustness even when some modalities are unavailable. This survey reviews recent progress in Multimodal Learning with Missing Modality (MLMM), focusing on deep learning methods. It provides the first comprehensive survey that covers the motivation and distinctions between MLMM and standard multimodal learning setups, followed by a detailed analysis of current methods, applications, and datasets, concluding with challenges and future directions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies