Context-specific Credibility-aware Multimodal Fusion with Conditional Probabilistic Circuits
Pranuthi Tenali, Sahil Sidheekh, Saurabh Mathur, Erik Blasch, Kristian Kersting, Sriraam Natarajan

TL;DR
This paper presents C²MF, a novel multimodal fusion framework that adaptively models source reliability at the instance level using Conditional Probabilistic Circuits, improving robustness in conflicting and noisy scenarios.
Contribution
It introduces CSIC, a KL-divergence-based measure for context-specific reliability, and a new Conflict benchmark for evaluating robustness against class-specific corruptions.
Findings
C²MF improves accuracy by up to 29% over static methods in high-noise scenarios.
CSIC generalizes static credibility estimates, enabling adaptive reliability assessment.
Experimental results demonstrate enhanced robustness and interpretability in multimodal fusion.
Abstract
Multimodal fusion requires integrating information from multiple sources that may conflict depending on context. Existing fusion approaches typically rely on static assumptions about source reliability, limiting their ability to resolve conflicts when a modality becomes unreliable due to situational factors such as sensor degradation or class-specific corruption. We introduce CMF, a context-specfic credibility-aware multimodal fusion framework that models per-instance source reliability using a Conditional Probabilistic Circuit (CPC). We formalize instance-level reliability through Context-Specific Information Credibility (CSIC), a KL-divergence-based measure computed exactly from the CPC. CSIC generalizes conventional static credibility estimates as a special case, enabling principled and adaptive reliability assessment. To evaluate robustness under cross-modal conflicts, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
