Learning local discrete features in explainable-by-design convolutional   neural networks

Pantelis I. Kaplanoglou; Konstantinos Diamantaras

arXiv:2411.00139·cs.LG·November 4, 2024

Learning local discrete features in explainable-by-design convolutional neural networks

Pantelis I. Kaplanoglou, Konstantinos Diamantaras

PDF

Open Access 1 Repo

TL;DR

This paper introduces ExplaiNet, an explainable CNN framework that uses local discrete features and Bayesian networks to improve interpretability without sacrificing performance, demonstrated on image classification tasks.

Contribution

The paper presents a novel explainable CNN architecture that incorporates local discrete features and probabilistic graph explanations, enhancing interpretability while maintaining high accuracy.

Findings

01

Achieves state-of-the-art performance on MNIST with 0.75 million parameters.

02

Provides causal explanations through Bayesian network motifs.

03

Maintains performance comparable to baseline models on benchmark datasets.

Abstract

Our proposed framework attempts to break the trade-off between performance and explainability by introducing an explainable-by-design convolutional neural network (CNN) based on the lateral inhibition mechanism. The ExplaiNet model consists of the predictor, that is a high-accuracy CNN with residual or dense skip connections, and the explainer probabilistic graph that expresses the spatial interactions of the network neurons. The value on each graph node is a local discrete feature (LDF) vector, a patch descriptor that represents the indices of antagonistic neurons ordered by the strength of their activations, which are learned with gradient descent. Using LDFs as sequences we can increase the conciseness of explanations by repurposing EXTREME, an EM-based sequence motif discovery method that is typically used in molecular biology. Having a discrete feature motif matrix for each one of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pikaplan/LearnExplaiNet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification