Q-SENN: Quantized Self-Explaining Neural Networks

Thomas Norrenbrock; Marco Rudolph; Bodo Rosenhahn

arXiv:2312.13839·cs.CV·February 19, 2024·1 cites

Q-SENN: Quantized Self-Explaining Neural Networks

Thomas Norrenbrock, Marco Rudolph, Bodo Rosenhahn

PDF

Open Access 1 Repo

TL;DR

Q-SENN introduces a quantized, interpretable neural network that maintains high accuracy, improves explainability with binary class-feature relations, and aligns features with human concepts without extra supervision.

Contribution

The paper presents Q-SENN, a novel self-explaining neural network that enhances interpretability and applicability to complex datasets while preserving model accuracy.

Findings

01

Q-SENN outperforms previous models in interpretability and accuracy metrics.

02

It describes class-feature relationships as positive, negative, or neutral.

03

Features can be aligned with human concepts without additional supervision.

Abstract

Explanations in Computer Vision are often desired, but most Deep Neural Networks can only provide saliency maps with questionable faithfulness. Self-Explaining Neural Networks (SENN) extract interpretable concepts with fidelity, diversity, and grounding to combine them linearly for decision-making. While they can explain what was recognized, initial realizations lack accuracy and general applicability. We propose the Quantized-Self-Explaining Neural Network Q-SENN. Q-SENN satisfies or exceeds the desiderata of SENN while being applicable to more complex datasets and maintaining most or all of the accuracy of an uninterpretable baseline model, out-performing previous work in all considered metrics. Q-SENN describes the relationship between every class and feature as either positive, negative or neutral instead of an arbitrary number of possible relations, enforcing more binary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thomasnorr/q-senn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications