Expressive Explanations of DNNs by Combining Concept Analysis with ILP

Johannes Rabold; Gesina Schwalbe; Ute Schmid

arXiv:2105.07371·cs.LG·May 18, 2021

Expressive Explanations of DNNs by Combining Concept Analysis with ILP

Johannes Rabold, Gesina Schwalbe, Ute Schmid

PDF

1 Repo

TL;DR

This paper presents a method to generate expressive, symbolic explanations for deep neural networks in computer vision by combining concept analysis with inductive logic programming, enhancing transparency and interpretability.

Contribution

It introduces a novel approach that uses inherent network features and concept analysis to produce first-order rule explanations for DNNs, improving interpretability in vision tasks.

Findings

01

Explanations are faithful to the original model.

02

Semantic features are effectively mined from network activations.

03

First-order rules provide more expressive explanations.

Abstract

Explainable AI has emerged to be a key component for black-box machine learning approaches in domains with a high demand for reliability or transparency. Examples are medical assistant systems, and applications concerned with the General Data Protection Regulation of the European Union, which features transparency as a cornerstone. Such demands require the ability to audit the rationale behind a classifier's decision. While visualizations are the de facto standard of explanations, they come short in terms of expressiveness in many ways: They cannot distinguish between different attribute manifestations of visual features (e.g. eye open vs. closed), and they cannot accurately describe the influence of absence of, and relations between features. An alternative would be more expressive symbolic surrogate models. However, these require symbolic inputs, which are not readily available in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mc-lovin-mlem/concept-embeddings-and-ilp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.