AnyCBMs: How to Turn Any Black Box into a Concept Bottleneck Model

Gabriele Dominici; Pietro Barbiero; Francesco Giannini; Martin; Gjoreski; Marc Langhenirich

arXiv:2405.16508·cs.LG·May 28, 2024

AnyCBMs: How to Turn Any Black Box into a Concept Bottleneck Model

Gabriele Dominici, Pietro Barbiero, Francesco Giannini, Martin, Gjoreski, Marc Langhenirich

PDF

Open Access

TL;DR

AnyCBMs is a novel method that converts existing trained neural networks into interpretable concept bottleneck models, enabling better understanding and intervention without retraining from scratch.

Contribution

The paper introduces AnyCBM, a technique to transform pre-trained models into concept bottleneck models with minimal additional training or resources.

Findings

01

Effective in maintaining classification performance

02

Improves interpretability through concept-based explanations

03

Enables interventions on downstream tasks

Abstract

Interpretable deep learning aims at developing neural architectures whose decision-making processes could be understood by their users. Among these techniqes, Concept Bottleneck Models enhance the interpretability of neural networks by integrating a layer of human-understandable concepts. These models, however, necessitate training a new model from the beginning, consuming significant resources and failing to utilize already trained large models. To address this issue, we introduce "AnyCBM", a method that transforms any existing trained model into a Concept Bottleneck Model with minimal impact on computational resources. We provide both theoretical and experimental insights showing the effectiveness of AnyCBMs in terms of classification performances and effectivenss of concept-based interventions on downstream tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis