A Methodology for Explainable Large Language Models with Integrated   Gradients and Linguistic Analysis in Text Classification

Marina Ribeiro (1; 2); B\'arbara Malcorra (2); Nat\'alia B. Mota (2; and 3); Rodrigo Wilkens (4; 5); Aline Villavicencio (5; 6) Lilian C.; Hubner (7); C\'esar Renn\'o-Costa (1) ((1) Bioinformatics Multidisciplinary; Environment (BioME); Digital Metropolis Institute (IMD); Federal University; of Rio Grande do Norte (UFRN); Natal (RN); Brazil; (2) Research Department at; Mobile Brain; Mobile Brain; Rio de Janeiro (RJ); Brazil; (3) Institute of; Psychiatry (IPUB); Federal University of Rio de Janeiro (UFRJ); Rio de; Janeiro (RJ); Brazil; (4) Department of Computer Science; The University of; Exeter; Exeter; UK; (5) Institute for Data Science; Artificial; Intelligence at the University of Exeter; Exeter; UK; (6) Department of; Computer Science; The University of Sheffield; Sheffield; UK; (7) School of; Humanities; Pontifical Catholic University of Rio Grande do Sul (PUCRS),; Porto Alegre (RS); Brazil)

arXiv:2410.00250·cs.CL·October 2, 2024·2 cites

A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification

Marina Ribeiro (1, 2), B\'arbara Malcorra (2), Nat\'alia B. Mota (2, and 3), Rodrigo Wilkens (4, 5), Aline Villavicencio (5, 6) Lilian C., Hubner (7), C\'esar Renn\'o-Costa (1) ((1) Bioinformatics Multidisciplinary, Environment (BioME), Digital Metropolis Institute (IMD)

PDF

Open Access

TL;DR

This paper introduces SLIME, an explainability method for large language models like BERT, which identifies and interprets lexical features relevant to Alzheimer's detection in speech transcripts, combining Integrated Gradients and linguistic analysis.

Contribution

The paper presents SLIME, a novel approach integrating Integrated Gradients and linguistic tools to explain LLM decisions in neurological speech analysis.

Findings

01

BERT uses lexical features indicating reduced social references in AD.

02

SLIME effectively highlights features that improve model accuracy.

03

The method enhances interpretability of LLMs in clinical neurodegeneration studies.

Abstract

Neurological disorders that affect speech production, such as Alzheimer's Disease (AD), significantly impact the lives of both patients and caregivers, whether through social, psycho-emotional effects or other aspects not yet fully understood. Recent advancements in Large Language Model (LLM) architectures have developed many tools to identify representative features of neurological disorders through spontaneous speech. However, LLMs typically lack interpretability, meaning they do not provide clear and specific reasons for their decisions. Therefore, there is a need for methods capable of identifying the representative features of neurological disorders in speech and explaining clearly why these features are relevant. This paper presents an explainable LLM method, named SLIME (Statistical and Linguistic Insights for Model Explanation), capable of identifying lexical components…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsLinear Layer · Softmax · Attention Dropout · Multi-Head Attention · Layer Normalization · Dense Connections · Attention Is All You Need · Adam · WordPiece · Linear Warmup With Linear Decay