TL;DR
This paper introduces Explingo, a system using Large Language Models to convert traditional ML explanations into human-readable narratives, with an evaluation framework and open-source implementation for improved explainability.
Contribution
It presents a novel LLM-based approach with Narrator and Grader modules for generating and evaluating narrative explanations of ML models.
Findings
LLMs can generate high-quality, human-readable explanations.
Narrative quality improves with few human-labeled examples.
Challenges remain in scoring explanations in complex domains.
Abstract
Explanations of machine learning (ML) model predictions generated by Explainable AI (XAI) techniques such as SHAP are essential for people using ML outputs for decision-making. We explore the potential of Large Language Models (LLMs) to transform these explanations into human-readable, narrative formats that align with natural communication. We address two key research questions: (1) Can LLMs reliably transform traditional explanations into high-quality narratives? and (2) How can we effectively evaluate the quality of narrative explanations? To answer these questions, we introduce Explingo, which consists of two LLM-based subsystems, a Narrator and Grader. The Narrator takes in ML explanations and transforms them into natural-language descriptions. The Grader scores these narratives on a set of metrics including accuracy, completeness, fluency, and conciseness. Our experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Shapley Additive Explanations · ALIGN
