Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman; Mateo Espinosa Zarlenga; and Mateja Jamnik

arXiv:2405.18217·cs.LG·May 29, 2024

Understanding Inter-Concept Relationships in Concept-Based Models

Naveen Raman, Mateo Espinosa Zarlenga, and Mateja Jamnik

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether concept-based models accurately capture inter-concept relationships, revealing their limitations in stability and robustness, and proposes a new algorithm to leverage these relationships for improved interpretability and downstream task performance.

Contribution

The paper provides an empirical analysis of inter-concept relationship capture in concept-based models and introduces a novel algorithm to enhance their effectiveness by leveraging these relationships.

Findings

01

State-of-the-art models lack stability and robustness in concept representations.

02

Current models fail to effectively capture inter-concept relationships.

03

A new algorithm improves concept intervention accuracy by leveraging inter-concept relationships.

Abstract

Concept-based explainability methods provide insight into deep learning systems by constructing explanations using human-understandable concepts. While the literature on human reasoning demonstrates that we exploit relationships between concepts when solving tasks, it is unclear whether concept-based methods incorporate the rich structure of inter-concept relationships. We analyse the concept representations learnt by concept-based models to understand whether these models correctly capture inter-concept relationships. First, we empirically demonstrate that state-of-the-art concept-based models produce representations that lack stability and robustness, and such methods fail to capture inter-concept relationships. Then, we develop a novel algorithm which leverages inter-concept relationships to improve concept intervention accuracy, demonstrating how correctly capturing inter-concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naveenr414/Concept-Learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies