E-KAR: A Benchmark for Rationalizing Natural Language Analogical   Reasoning

Jiangjie Chen; Rui Xu; Ziquan Fu; Wei Shi; Zhongqiao Li; Xinbo Zhang,; Changzhi Sun; Lei Li; Yanghua Xiao; Hao Zhou

arXiv:2203.08480·cs.CL·October 27, 2022

E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning

Jiangjie Chen, Rui Xu, Ziquan Fu, Wei Shi, Zhongqiao Li, Xinbo Zhang,, Changzhi Sun, Lei Li, Yanghua Xiao, Hao Zhou

PDF

TL;DR

E-KAR is a new benchmark designed to evaluate and explain the reasoning process of neural models in solving knowledge-intensive analogical questions, highlighting current models' limitations.

Contribution

This paper introduces E-KAR, the first benchmark with explainability for analogical reasoning, combining Chinese and English questions with manual annotations.

Findings

01

Current models struggle with explanation generation.

02

Models perform poorly on analogical question answering.

03

Benchmark reveals gaps in neural reasoning capabilities.

Abstract

The ability to recognize analogies is fundamental to human cognition. Existing benchmarks to test word analogy do not reveal the underneath process of analogical reasoning of neural models. Holding the belief that models capable of reasoning should be right for the right reasons, we propose a first-of-its-kind Explainable Knowledge-intensive Analogical Reasoning benchmark (E-KAR). Our benchmark consists of 1,655 (in Chinese) and 1,251 (in English) problems sourced from the Civil Service Exams, which require intensive background knowledge to solve. More importantly, we design a free-text explanation scheme to explain whether an analogy should be drawn, and manually annotate them for each and every question and candidate answer. Empirical results suggest that this benchmark is very challenging for some state-of-the-art models for both explanation generation and analogical question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodstravel james