Improved Multimodal Fusion for Small Datasets with Auxiliary Supervision
Gregory Holste, Douwe van der Wal, Hans Pinckaers, Rikiya Yamashita,, Akinori Mitani, Andre Esteva

TL;DR
This paper introduces three simple auxiliary supervision techniques to improve multimodal fusion in small datasets, demonstrated on prostate cancer diagnosis with image and clinical data, reducing overfitting and enhancing model performance.
Contribution
Proposes three auxiliary supervision methods for better multimodal fusion in small datasets, avoiding complex fusion operations and improving training stability.
Findings
Enhanced fusion methods outperform traditional concatenation.
Auxiliary supervision improves model generalization.
Applicable to various paired image and non-image classification tasks.
Abstract
Prostate cancer is one of the leading causes of cancer-related death in men worldwide. Like many cancers, diagnosis involves expert integration of heterogeneous patient information such as imaging, clinical risk factors, and more. For this reason, there have been many recent efforts toward deep multimodal fusion of image and non-image data for clinical decision tasks. Many of these studies propose methods to fuse learned features from each patient modality, providing significant downstream improvements with techniques like cross-modal attention gating, Kronecker product fusion, orthogonality regularization, and more. While these enhanced fusion operations can improve upon feature concatenation, they often come with an extremely high learning capacity, meaning they are likely to overfit when applied even to small or low-dimensional datasets. Rather than designing a highly expressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis
