Uncertainty-aware Multi-modal Learning via Cross-modal Random Network Prediction
Hu Wang, Jianpeng Zhang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise, Hull, Gustavo Carneiro

TL;DR
This paper introduces CRNP, a novel uncertainty estimation method for multi-modal learning that improves prediction accuracy by effectively combining modalities across various tasks.
Contribution
It presents the first use of random network prediction for uncertainty estimation and multi-modal data fusion, with a stable and adaptable training process.
Findings
Effective on 3D medical image segmentation tasks
Improves robustness in multi-modal classification
Demonstrates adaptability across different tasks
Abstract
Multi-modal learning focuses on training models by equally combining multiple input data modalities during the prediction process. However, this equal combination can be detrimental to the prediction accuracy because different modalities are usually accompanied by varying levels of uncertainty. Using such uncertainty to combine modalities has been studied by a couple of approaches, but with limited success because these approaches are either designed to deal with specific classification or segmentation problems and cannot be easily translated into other tasks, or suffer from numerical instabilities. In this paper, we propose a new Uncertainty-aware Multi-modal Learner that estimates uncertainty by measuring feature density via Cross-modal Random Network Prediction (CRNP). CRNP is designed to require little adaptation to translate between different prediction tasks, while having a stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
