Adaptive Confidence Regularization for Multimodal Failure Detection
Moru Liu, Hao Dong, Olga Fink, Mario Trapp

TL;DR
This paper introduces Adaptive Confidence Regularization (ACR), a novel framework for failure detection in multimodal models, which improves reliability by penalizing confidence degradation and synthesizing failure-aware training examples.
Contribution
The paper proposes ACR, a new method combining confidence regularization and feature swapping to enhance failure detection in multimodal systems.
Findings
ACR achieves consistent performance improvements across multiple datasets.
The method effectively detects and rejects uncertain multimodal predictions.
Synthetic failure examples improve the model's failure recognition capabilities.
Abstract
The deployment of multimodal models in high-stakes domains, such as self-driving vehicles and medical diagnostics, demands not only strong predictive performance but also reliable mechanisms for detecting failures. In this work, we address the largely unexplored problem of failure detection in multimodal contexts. We propose Adaptive Confidence Regularization (ACR), a novel framework specifically designed to detect multimodal failures. Our approach is driven by a key observation: in most failure cases, the confidence of the multimodal prediction is significantly lower than that of at least one unimodal branch, a phenomenon we term confidence degradation. To mitigate this, we introduce an Adaptive Confidence Loss that penalizes such degradations during training. In addition, we propose Multimodal Feature Swapping, a novel outlier synthesis technique that generates challenging,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
