A Framework for Learning Ante-hoc Explainable Models via Concepts

Anirban Sarkar; Deepak Vijaykeerthy; Anindya Sarkar; Vineeth N; Balasubramanian

arXiv:2108.11761·cs.LG·December 2, 2021

A Framework for Learning Ante-hoc Explainable Models via Concepts

Anirban Sarkar, Deepak Vijaykeerthy, Anindya Sarkar, Vineeth N, Balasubramanian

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel self-explaining deep learning framework that jointly learns concepts and generates explanations, achieving high predictive accuracy and interpretability, especially on large-scale datasets like ImageNet.

Contribution

It proposes a flexible, joint training model for ante-hoc explainability that works with both supervised and unsupervised concepts, outperforming existing methods.

Findings

01

Achieves high predictive performance with concept supervision.

02

Effectively generates meaningful concept-based explanations.

03

Demonstrates scalability to large datasets like ImageNet.

Abstract

Self-explaining deep models are designed to learn the latent concept-based explanations implicitly during training, which eliminates the requirement of any post-hoc explanation generation technique. In this work, we propose one such model that appends an explanation generation module on top of any basic network and jointly trains the whole module that shows high predictive performance and generates meaningful explanations in terms of concepts. Our training strategy is suitable for unsupervised concept learning with much lesser parameter space requirements compared to baseline methods. Our proposed model also has provision for leveraging self-supervision on concepts to extract better explanations. However, with full concept supervision, we achieve the best predictive performance compared to recently proposed concept-based explainable models. We report both qualitative and quantitative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anirbansarkar-cs/Ante-hoc_Explainability_Concepts
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Advanced Neural Network Applications