Linearly-Interpretable Concept Embedding Models for Text Analysis

Francesco De Santis; Philippe Bich; Gabriele Ciravegna; Pietro Barbiero; Danilo Giordano; Tania Cerquitelli

arXiv:2406.14335·cs.CL·July 17, 2025

Linearly-Interpretable Concept Embedding Models for Text Analysis

Francesco De Santis, Philippe Bich, Gabriele Ciravegna, Pietro Barbiero, Danilo Giordano, Tania Cerquitelli

PDF

Open Access

TL;DR

This paper introduces LICEM, a novel interpretable text classification model that surpasses previous interpretable models in accuracy, matches black-box models, and can be trained without concept supervision by leveraging LLM backbones.

Contribution

LICEM offers a linearly interpretable concept embedding approach that improves accuracy, enables causal explanations, and reduces annotation needs compared to existing concept-bottleneck models.

Findings

01

LICEM achieves higher accuracy than existing interpretable models.

02

LICEM's explanations are more interveneable and causally consistent.

03

LICEM can be trained without concept supervision using LLM backbones.

Abstract

Despite their success, Large-Language Models (LLMs) still face criticism due to their lack of interpretability. Traditional post-hoc interpretation methods, based on attention and gradient-based analysis, offer limited insights as they only approximate the model's decision-making processes and have been proved to be unreliable. For this reason, Concept-Bottleneck Models (CBMs) have been lately proposed in the textual field to provide interpretable predictions based on human-understandable concepts. However, CBMs still exhibit several limitations due to their architectural constraints limiting their expressivity, to the absence of task-interpretability when employing non-linear task predictors and for requiring extensive annotations that are impractical for real-world text data. In this paper, we address these challenges by proposing a novel Linearly Interpretable Concept Embedding Model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text and Document Classification Technologies

MethodsSoftmax · Attention Is All You Need