Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability
Xin Lv, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Yichi Zhang, Zelin, Dai

TL;DR
This paper introduces a framework and benchmark for evaluating the interpretability of multi-hop reasoning models, revealing current models' interpretability is limited and suggesting rule integration as a future direction.
Contribution
It proposes a unified quantitative evaluation framework and a benchmark dataset for assessing multi-hop reasoning interpretability, highlighting the gap between current models and rule-based approaches.
Findings
Current models have low interpretability scores.
Rule-based models outperform reasoning models in interpretability.
There is significant room for improving multi-hop reasoning interpretability.
Abstract
Multi-hop reasoning has been widely studied in recent years to obtain more interpretable link prediction. However, we find in experiments that many paths given by these models are actually unreasonable, while little works have been done on interpretability evaluation for them. In this paper, we propose a unified framework to quantitatively evaluate the interpretability of multi-hop reasoning models so as to advance their development. In specific, we define three metrics including path recall, local interpretability, and global interpretability for evaluation, and design an approximate strategy to calculate them using the interpretability scores of rules. Furthermore, we manually annotate all possible rules and establish a Benchmark to detect the Interpretability of Multi-hop Reasoning (BIMR). In experiments, we run nine baselines on our benchmark. The experimental results show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Explainable Artificial Intelligence (XAI)
