Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking   Reasoning Interpretability

Xin Lv; Yixin Cao; Lei Hou; Juanzi Li; Zhiyuan Liu; Yichi Zhang; Zelin; Dai

arXiv:2104.06751·cs.AI·September 10, 2021

Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability

Xin Lv, Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Yichi Zhang, Zelin, Dai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework and benchmark for evaluating the interpretability of multi-hop reasoning models, revealing current models' interpretability is limited and suggesting rule integration as a future direction.

Contribution

It proposes a unified quantitative evaluation framework and a benchmark dataset for assessing multi-hop reasoning interpretability, highlighting the gap between current models and rule-based approaches.

Findings

01

Current models have low interpretability scores.

02

Rule-based models outperform reasoning models in interpretability.

03

There is significant room for improving multi-hop reasoning interpretability.

Abstract

Multi-hop reasoning has been widely studied in recent years to obtain more interpretable link prediction. However, we find in experiments that many paths given by these models are actually unreasonable, while little works have been done on interpretability evaluation for them. In this paper, we propose a unified framework to quantitatively evaluate the interpretability of multi-hop reasoning models so as to advance their development. In specific, we define three metrics including path recall, local interpretability, and global interpretability for evaluation, and design an approximate strategy to calculate them using the interpretability scores of rules. Furthermore, we manually annotate all possible rules and establish a Benchmark to detect the Interpretability of Multi-hop Reasoning (BIMR). In experiments, we run nine baselines on our benchmark. The experimental results show that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

THU-KEG/BIMR
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Explainable Artificial Intelligence (XAI)