Linearly-Interpretable Concept Embedding Models for Text Analysis
Francesco De Santis, Philippe Bich, Gabriele Ciravegna, Pietro Barbiero, Danilo Giordano, Tania Cerquitelli

TL;DR
This paper introduces LICEM, a novel interpretable text classification model that surpasses previous interpretable models in accuracy, matches black-box models, and can be trained without concept supervision by leveraging LLM backbones.
Contribution
LICEM offers a linearly interpretable concept embedding approach that improves accuracy, enables causal explanations, and reduces annotation needs compared to existing concept-bottleneck models.
Findings
LICEM achieves higher accuracy than existing interpretable models.
LICEM's explanations are more interveneable and causally consistent.
LICEM can be trained without concept supervision using LLM backbones.
Abstract
Despite their success, Large-Language Models (LLMs) still face criticism due to their lack of interpretability. Traditional post-hoc interpretation methods, based on attention and gradient-based analysis, offer limited insights as they only approximate the model's decision-making processes and have been proved to be unreliable. For this reason, Concept-Bottleneck Models (CBMs) have been lately proposed in the textual field to provide interpretable predictions based on human-understandable concepts. However, CBMs still exhibit several limitations due to their architectural constraints limiting their expressivity, to the absence of task-interpretability when employing non-linear task predictors and for requiring extensive annotations that are impractical for real-world text data. In this paper, we address these challenges by proposing a novel Linearly Interpretable Concept Embedding Model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text and Document Classification Technologies
MethodsSoftmax · Attention Is All You Need
