On the Factual Consistency of Text-based Explainable Recommendation Models

Ben Kabongo; Vincent Guigue

arXiv:2512.24366·cs.IR·May 18, 2026

On the Factual Consistency of Text-based Explainable Recommendation Models

Ben Kabongo, Vincent Guigue

PDF

1 Datasets

TL;DR

This paper introduces a framework to evaluate the factual consistency of text-based explainable recommendation models, revealing a significant gap between semantic similarity and factual accuracy.

Contribution

It proposes a prompting-based pipeline for extracting factual statements and new metrics for assessing explanation factuality, addressing a critical gap in current evaluation methods.

Findings

01

Models show high semantic similarity scores but low factuality accuracy.

02

The proposed metrics reveal a significant gap in explanation factuality.

03

Benchmark datasets and evaluation pipeline are provided for future research.

Abstract

Text-based explainable recommendation aims to generate natural-language explanations that justify item recommendations, to improve user trust and system transparency. Although recent advances leverage LLMs to produce fluent outputs, a critical question remains underexplored: are these explanations factually consistent with the available evidence? We introduce a comprehensive framework for evaluating the factual consistency of text-based explainable recommenders. We design a prompting-based pipeline that uses LLMs to extract atomic explanatory statements from reviews, thereby constructing a ground truth that isolates and focuses on their factual content. Applying this pipeline to five categories from the Amazon Reviews dataset, we create augmented benchmarks for fine-grained evaluation of explanation quality. We further propose statement-level alignment metrics that combine LLM- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

benkabongo25/amazon-reviews-statement-v0
dataset· 212 dl
212 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Recommender Systems and Techniques · Artificial Intelligence in Healthcare and Education