Towards Achieving Concept Completeness for Textual Concept Bottleneck Models

Milan Bhan; Yann Choho; Pierre Moreau; Jean-Noel Vittaut; Nicolas Chesneau; Marie-Jeanne Lesot

arXiv:2502.11100·cs.CL·May 29, 2025

Towards Achieving Concept Completeness for Textual Concept Bottleneck Models

Milan Bhan, Yann Choho, Pierre Moreau, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot

PDF

Open Access 1 Video

TL;DR

This paper introduces CT-CBM, an unsupervised method for creating complete and interpretable concept bases in textual concept bottleneck models, improving interpretability and concept detection without human-labeled data.

Contribution

It presents a novel unsupervised approach to generate complete concept bases in TCBMs, reducing reliance on human annotations and enhancing interpretability.

Findings

01

Outperforms competitors in concept basis completeness

02

Achieves high concept detection accuracy

03

Eliminates need for human-labeled concepts

Abstract

Textual Concept Bottleneck Models (TCBMs) are interpretable-by-design models for text classification that predict a set of salient concepts before making the final prediction. This paper proposes Complete Textual Concept Bottleneck Model (CT-CBM), a novel TCBM generator building concept labels in a fully unsupervised manner using a small language model, eliminating both the need for predefined human labeled concepts and LLM annotations. CT-CBM iteratively targets and adds important and identifiable concepts in the bottleneck layer to create a complete concept basis. CT-CBM achieves striking results against competitors in terms of concept basis completeness and concept detection accuracy, offering a promising solution to reliably enhance interpretability of NLP classifiers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Achieving Concept Completeness for Textual Concept Bottleneck Models· underline

Taxonomy

TopicsData Management and Algorithms · Bayesian Modeling and Causal Inference

MethodsSparse Evolutionary Training