Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling
Hu Wang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo, Carneiro

TL;DR
The paper introduces ShaSpec, a novel multi-modal learning method that effectively handles missing modalities by learning shared and specific features, improving performance across various tasks like classification and segmentation.
Contribution
ShaSpec is a simple, effective approach that leverages auxiliary tasks and residual fusion to handle missing modalities and adapt to multiple tasks.
Findings
Outperforms existing methods significantly in medical image segmentation and classification.
Achieves over 3% improvement on BraTS2018 for tumor segmentation.
Easily adaptable to different multi-modal tasks.
Abstract
The missing modality issue is critical but non-trivial to be solved by multi-modal models. Current methods aiming to handle the missing modality problem in multi-modal tasks, either deal with missing modalities only during evaluation or train separate models to handle specific missing modality settings. In addition, these models are designed for specific tasks, so for example, classification models are not easily adapted to segmentation tasks and vice versa. In this paper, we propose the Shared-Specific Feature Modelling (ShaSpec) method that is considerably simpler and more effective than competing approaches that address the issues above. ShaSpec is designed to take advantage of all available input modalities during training and evaluation by learning shared and specific features to better represent the input data. This is achieved from a strategy that relies on auxiliary tasks based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
