LAP: An Attention-Based Module for Concept Based Self-Interpretation and   Knowledge Injection in Convolutional Neural Networks

Rassa Ghavami Modegh; Ahmad Salimi; Alireza Dizaji; Hamid R. Rabiee

arXiv:2201.11808·cs.CV·October 25, 2023

LAP: An Attention-Based Module for Concept Based Self-Interpretation and Knowledge Injection in Convolutional Neural Networks

Rassa Ghavami Modegh, Ahmad Salimi, Alireza Dizaji, Hamid R. Rabiee

PDF

Open Access

TL;DR

This paper introduces LAP, an attention-based pooling layer that enhances CNN interpretability and allows knowledge injection without performance loss, applicable to trained models, verified on ImageNet.

Contribution

Proposes a pluggable attention-based pooling layer called LAP that provides self-interpretability and knowledge injection in CNNs without degrading accuracy.

Findings

01

LAP improves model interpretability over traditional explainers.

02

LAP can be integrated into existing CNNs, including trained models.

03

LAP maintains or improves classification performance.

Abstract

Despite the state-of-the-art performance of deep convolutional neural networks, they are susceptible to bias and malfunction in unseen situations. Moreover, the complex computation behind their reasoning is not human-understandable to develop trust. External explainer methods have tried to interpret network decisions in a human-understandable way, but they are accused of fallacies due to their assumptions and simplifications. On the other side, the inherent self-interpretability of models, while being more robust to the mentioned fallacies, cannot be applied to the already trained models. In this work, we propose a new attention-based pooling layer, called Local Attention Pooling (LAP), that accomplishes self-interpretability and the possibility for knowledge injection without performance loss. The module is easily pluggable into any convolutional neural network, even the already…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications

MethodsAttention Pooling