A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification
Marina Ribeiro (1, 2), B\'arbara Malcorra (2), Nat\'alia B. Mota (2, and 3), Rodrigo Wilkens (4, 5), Aline Villavicencio (5, 6) Lilian C., Hubner (7), C\'esar Renn\'o-Costa (1) ((1) Bioinformatics Multidisciplinary, Environment (BioME), Digital Metropolis Institute (IMD)

TL;DR
This paper introduces SLIME, an explainability method for large language models like BERT, which identifies and interprets lexical features relevant to Alzheimer's detection in speech transcripts, combining Integrated Gradients and linguistic analysis.
Contribution
The paper presents SLIME, a novel approach integrating Integrated Gradients and linguistic tools to explain LLM decisions in neurological speech analysis.
Findings
BERT uses lexical features indicating reduced social references in AD.
SLIME effectively highlights features that improve model accuracy.
The method enhances interpretability of LLMs in clinical neurodegeneration studies.
Abstract
Neurological disorders that affect speech production, such as Alzheimer's Disease (AD), significantly impact the lives of both patients and caregivers, whether through social, psycho-emotional effects or other aspects not yet fully understood. Recent advancements in Large Language Model (LLM) architectures have developed many tools to identify representative features of neurological disorders through spontaneous speech. However, LLMs typically lack interpretability, meaning they do not provide clear and specific reasons for their decisions. Therefore, there is a need for methods capable of identifying the representative features of neurological disorders in speech and explaining clearly why these features are relevant. This paper presents an explainable LLM method, named SLIME (Statistical and Linguistic Insights for Model Explanation), capable of identifying lexical components…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsLinear Layer · Softmax · Attention Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Attention Is All You Need · Adam · WordPiece · Linear Warmup With Linear Decay
