DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation
Rakshith Subramanyam, Kowshik Thopalli, Vivek Narayanaswamy, Jayaraman, J.Thiagarajan

TL;DR
DECIDER is a novel method that leverages large language models and vision-language models to detect and explain failures in image classification models, improving safety and reliability.
Contribution
DECIDER introduces a debiasing approach using LLMs and VLMs to identify model failures and provide human-interpretable explanations, outperforming existing methods.
Findings
Achieves state-of-the-art failure detection across diverse benchmarks.
Significantly outperforms baselines in Matthews correlation coefficient.
Provides human-interpretable failure explanations.
Abstract
Reliably detecting when a deployed machine learning model is likely to fail on a given input is crucial for ensuring safe operation. In this work, we propose DECIDER (Debiasing Classifiers to Identify Errors Reliably), a novel approach that leverages priors from large language models (LLMs) and vision-language models (VLMs) to detect failures in image classification models. DECIDER utilizes LLMs to specify task-relevant core attributes and constructs a ``debiased'' version of the classifier by aligning its visual features to these core attributes using a VLM, and detects potential failure by measuring disagreement between the original and debiased models. In addition to proactively identifying samples on which the model would fail, DECIDER also provides human-interpretable explanations for failure through a novel attribute-ablation strategy. Through extensive experiments across diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Testing and Debugging Techniques · Fault Detection and Control Systems
