Multi-modal Learning with Missing Modality via Shared-Specific Feature   Modelling

Hu Wang; Yuanhong Chen; Congbo Ma; Jodie Avery; Louise Hull; Gustavo; Carneiro

arXiv:2307.14126·cs.CV·June 14, 2024·5 cites

Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling

Hu Wang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo, Carneiro

PDF

Open Access 1 Repo

TL;DR

The paper introduces ShaSpec, a novel multi-modal learning method that effectively handles missing modalities by learning shared and specific features, improving performance across various tasks like classification and segmentation.

Contribution

ShaSpec is a simple, effective approach that leverages auxiliary tasks and residual fusion to handle missing modalities and adapt to multiple tasks.

Findings

01

Outperforms existing methods significantly in medical image segmentation and classification.

02

Achieves over 3% improvement on BraTS2018 for tumor segmentation.

03

Easily adaptable to different multi-modal tasks.

Abstract

The missing modality issue is critical but non-trivial to be solved by multi-modal models. Current methods aiming to handle the missing modality problem in multi-modal tasks, either deal with missing modalities only during evaluation or train separate models to handle specific missing modality settings. In addition, these models are designed for specific tasks, so for example, classification models are not easily adapted to segmentation tasks and vice versa. In this paper, we propose the Shared-Specific Feature Modelling (ShaSpec) method that is considerably simpler and more effective than competing approaches that address the issues above. ShaSpec is designed to take advantage of all available input modalities during training and evaluation by learning shared and specific features to better represent the input data. This is achieved from a strategy that relies on auxiliary tasks based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

billhhh/ShaSpec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques