Using LLMs for Explaining Sets of Counterfactual Examples to Final Users
Arturo Fredes, Jordi Vitria

TL;DR
This paper introduces a multi-step pipeline leveraging large language models to generate natural language explanations from sets of counterfactual examples, aiding end-users in understanding model decisions in tabular data.
Contribution
It presents a novel approach that guides LLMs through smaller reasoning tasks to produce coherent explanations based on counterfactuals, improving interpretability in Explainable AI.
Findings
Promising results in generating explanations from counterfactuals
Proposed closed-loop evaluation for explanation coherence
Initial experiments show potential, further validation needed
Abstract
Causality is vital for understanding true cause-and-effect relationships between variables within predictive models, rather than relying on mere correlations, making it highly relevant in the field of Explainable AI. In an automated decision-making scenario, causal inference methods can analyze the underlying data-generation process, enabling explanations of a model's decision by manipulating features and creating counterfactual examples. These counterfactuals explore hypothetical scenarios where a minimal number of factors are altered, providing end-users with valuable information on how to change their situation. However, interpreting a set of multiple counterfactuals can be challenging for end-users who are not used to analyzing raw data records. In our work, we propose a novel multi-step pipeline that uses counterfactuals to generate natural language explanations of actions that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Data Quality and Management
MethodsSparse Evolutionary Training · Counterfactuals Explanations · Causal inference
