Towards Achieving Concept Completeness for Textual Concept Bottleneck Models
Milan Bhan, Yann Choho, Pierre Moreau, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot

TL;DR
This paper introduces CT-CBM, an unsupervised method for creating complete and interpretable concept bases in textual concept bottleneck models, improving interpretability and concept detection without human-labeled data.
Contribution
It presents a novel unsupervised approach to generate complete concept bases in TCBMs, reducing reliance on human annotations and enhancing interpretability.
Findings
Outperforms competitors in concept basis completeness
Achieves high concept detection accuracy
Eliminates need for human-labeled concepts
Abstract
Textual Concept Bottleneck Models (TCBMs) are interpretable-by-design models for text classification that predict a set of salient concepts before making the final prediction. This paper proposes Complete Textual Concept Bottleneck Model (CT-CBM), a novel TCBM generator building concept labels in a fully unsupervised manner using a small language model, eliminating both the need for predefined human labeled concepts and LLM annotations. CT-CBM iteratively targets and adds important and identifiable concepts in the bottleneck layer to create a complete concept basis. CT-CBM achieves striking results against competitors in terms of concept basis completeness and concept detection accuracy, offering a promising solution to reliably enhance interpretability of NLP classifiers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Management and Algorithms · Bayesian Modeling and Causal Inference
MethodsSparse Evolutionary Training
