The Effect of Model Size on LLM Post-hoc Explainability via LIME

Henning Heyen; Amy Widdicombe; Noah Y. Siegel; Maria Perez-Ortiz,; Philip Treleaven

arXiv:2405.05348·cs.CL·May 10, 2024·1 cites

The Effect of Model Size on LLM Post-hoc Explainability via LIME

Henning Heyen, Amy Widdicombe, Noah Y. Siegel, Maria Perez-Ortiz,, Philip Treleaven

PDF

Open Access 1 Repo

TL;DR

This study investigates how increasing the size of large language models affects the quality of LIME explanations, revealing that larger models do not necessarily produce more plausible explanations despite better performance.

Contribution

It provides the first systematic analysis of the relationship between model size and LIME explanation quality across different NLP tasks.

Findings

01

Larger models do not yield more plausible explanations.

02

Model size correlates with improved performance but not explanation plausibility.

03

Faithfulness metrics may have limitations in NLI contexts.

Abstract

Large language models (LLMs) are becoming bigger to boost performance. However, little is known about how explainability is affected by this trend. This work explores LIME explanations for DeBERTaV3 models of four different sizes on natural language inference (NLI) and zero-shot classification (ZSC) tasks. We evaluate the explanations based on their faithfulness to the models' internal decision processes and their plausibility, i.e. their agreement with human explanations. The key finding is that increased model size does not correlate with plausibility despite improved model performance, suggesting a misalignment between the LIME explanations and the models' internal processes as model size increases. Our results further suggest limitations regarding faithfulness metrics in NLI contexts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

henningheyen/scalability-of-llm-posthoc-explanations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLung Cancer Diagnosis and Treatment · Speech Recognition and Synthesis

MethodsLocal Interpretable Model-Agnostic Explanations