Concept-based Explanations for Out-Of-Distribution Detectors

Jihye Choi; Jayaram Raghuram; Ryan Feng; Jiefeng Chen; Somesh Jha,; Atul Prakash

arXiv:2203.02586·cs.LG·June 7, 2023·1 cites

Concept-based Explanations for Out-Of-Distribution Detectors

Jihye Choi, Jayaram Raghuram, Ryan Feng, Jiefeng Chen, Somesh Jha,, Atul Prakash

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to interpret out-of-distribution detectors using high-level concepts, proposing new metrics and an unsupervised framework to improve explanation quality and understanding of detector decisions.

Contribution

It presents novel metrics for evaluating concept explanations and an unsupervised approach to learn concepts that enhance interpretability of OOD detectors.

Findings

01

Metrics effectively assess explanation quality.

02

Framework improves concept-based explanations.

03

Enhanced understanding of OOD detector decisions.

Abstract

Out-of-distribution (OOD) detection plays a crucial role in ensuring the safe deployment of deep neural network (DNN) classifiers. While a myriad of methods have focused on improving the performance of OOD detectors, a critical gap remains in interpreting their decisions. We help bridge this gap by providing explanations for OOD detectors based on learned high-level concepts. We first propose two new metrics for assessing the effectiveness of a particular set of concepts for explaining OOD detectors: 1) detection completeness, which quantifies the sufficiency of concepts for explaining an OOD-detector's decisions, and 2) concept separability, which captures the distributional separation between in-distribution and OOD data in the concept space. Based on these metrics, we propose an unsupervised framework for learning a set of concepts that satisfy the desired properties of high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jihyechoi77/concepts-for-ood
tfOfficial

Videos

Concept-based Explanations for Out-of-Distribution Detectors· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications