Towards Modeling Uncertainties of Self-explaining Neural Networks via   Conformal Prediction

Wei Qian; Chenxu Zhao; Yangyi Li; Fenglong Ma; Chao Zhang; Mengdi Huai

arXiv:2401.01549·cs.LG·January 4, 2024·1 cites

Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction

Wei Qian, Chenxu Zhao, Yangyi Li, Fenglong Ma, Chao Zhang, Mengdi Huai

PDF

Open Access

TL;DR

This paper introduces a novel uncertainty modeling framework for self-explaining neural networks that provides distribution-free uncertainty quantification for explanations and predictions, linking confidence levels across both components.

Contribution

It proposes a new framework that unifies uncertainty quantification for both predictions and explanations in self-explaining neural networks, addressing limitations of existing methods.

Findings

01

Strong distribution-free uncertainty modeling performance

02

Effective prediction set generation based on explanations

03

Theoretical analysis supports framework validity

Abstract

Despite the recent progress in deep neural networks (DNNs), it remains challenging to explain the predictions made by DNNs. Existing explanation methods for DNNs mainly focus on post-hoc explanations where another explanatory model is employed to provide explanations. The fact that post-hoc methods can fail to reveal the actual original reasoning process of DNNs raises the need to build DNNs with built-in interpretability. Motivated by this, many self-explaining neural networks have been proposed to generate not only accurate predictions but also clear and intuitive insights into why a particular decision was made. However, existing self-explaining networks are limited in providing distribution-free uncertainty quantification for the two simultaneously generated prediction outcomes (i.e., a sample's final prediction and its corresponding explanations for interpreting that prediction).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science

MethodsFocus