Few-Shot Multimodal Medical Imaging: A Theoretical Framework
Md Talha Mohsin, Ismail Abdulrashid

TL;DR
This paper develops a theoretical framework for few-shot multimodal medical imaging, explaining sample complexity, uncertainty, and interpretability, and validating it with controlled experiments.
Contribution
It introduces a unified theoretical approach using PAC and VC theory to analyze and guide multimodal medical imaging in low data regimes.
Findings
Complementary modalities reduce sample complexity.
Explanation variance decreases inversely with sample size.
Multimodal models show improved performance in few-shot settings.
Abstract
Medical imaging often operates under limited labeled data, especially in rare disease and low resource clinical environments. Existing multimodal and meta learning approaches improve performance in these settings but lack a theoretical explanation of why or when they succeed. This paper presents a unified theoretical framework for few shot multimodal medical imaging that jointly characterizes sample complexity, uncertainty quantification, and interpretability. Using PAC learning, VC theory, and PAC Bayesian analysis, we derive bounds that describe the minimum number of labeled samples required for reliable performance and show how complementary modalities reduce effective capacity through an information gain term. We further introduce a formal metric for explanation stability, proving that explanation variance decreases at an inverse n rate. A sequential Bayesian interpretation of Chain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
