Unsupervised Model Diagnosis
Yinong Oliver Wang, Eileen Li, Jinqi Luo, Zhaoning Wang, Fernando De, la Torre

TL;DR
This paper introduces UMO, an unsupervised framework that uses generative models to identify and visualize model vulnerabilities and failure modes in vision systems without human guidance.
Contribution
UMO is the first unsupervised approach to diagnose model robustness by discovering semantic counterfactuals without user input, leveraging generative models and text-based attribute matching.
Findings
Successfully identifies spurious correlations in vision models.
Visualizes failure modes without human intervention.
Validates on multiple vision tasks like classification and segmentation.
Abstract
Ensuring model explainability and robustness is essential for reliable deployment of deep vision systems. Current methods for evaluating robustness rely on collecting and annotating extensive test sets. While this is common practice, the process is labor-intensive and expensive with no guarantee of sufficient coverage across attributes of interest. Recently, model diagnosis frameworks have emerged leveraging user inputs (e.g., text) to assess the vulnerability of the model. However, such dependence on human can introduce bias and limitation given the domain knowledge of particular users. This paper proposes Unsupervised Model Diagnosis (UMO), that leverages generative models to produce semantic counterfactual explanations without any user guidance. Given a differentiable computer vision model (i.e., the target model), UMO optimizes for the most counterfactual directions in a generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Adversarial Robustness in Machine Learning
