ProtoShotXAI: Using Prototypical Few-Shot Architecture for Explainable AI
Samuel Hess, Gregory Ditzler

TL;DR
ProtoShotXAI introduces a novel explainable AI approach using a prototypical few-shot network to explore a black-box model's feature space, enhancing interpretability, model exploration, and adversarial sample detection.
Contribution
It is the first locally interpretable XAI model that extends to few-shot networks, leveraging contrastive manifolds for improved interpretability and trust.
Findings
Outperforms state-of-the-art XAI methods on MNIST, Omniglot, and ImageNet.
Provides better flexibility for model exploration.
Detects adversarial samples effectively.
Abstract
Unexplainable black-box models create scenarios where anomalies cause deleterious responses, thus creating unacceptable risks. These risks have motivated the field of eXplainable Artificial Intelligence (XAI) to improve trust by evaluating local interpretability in black-box neural networks. Unfortunately, the ground truth is unavailable for the model's decision, so evaluation is limited to qualitative assessment. Further, interpretability may lead to inaccurate conclusions about the model or a false sense of trust. We propose to improve XAI from the vantage point of the user's trust by exploring a black-box model's latent feature space. We present an approach, ProtoShotXAI, that uses a Prototypical few-shot network to explore the contrastive manifold between nonlinear features of different classes. A user explores the manifold by perturbing the input features of a query sample and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
