CIKQA: Learning Commonsense Inference with a Unified   Knowledge-in-the-loop QA Paradigm

Hongming Zhang; Yintong Huo; Yanai Elazar; Yangqiu Song; Yoav; Goldberg; Dan Roth

arXiv:2210.06246·cs.CL·October 13, 2022

CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

Hongming Zhang, Yintong Huo, Yanai Elazar, Yangqiu Song, Yoav, Goldberg, Dan Roth

PDF

Open Access

TL;DR

This paper introduces CIKQA, a benchmark for evaluating models' ability to perform commonsense inference and knowledge sufficiency across diverse tasks using a unified QA format, emphasizing knowledge understanding and generalization.

Contribution

It proposes a novel benchmark that separates knowledge acquisition from inference, aligning tasks with knowledge bases and assessing models' inference and generalization capabilities.

Findings

01

Models can identify if knowledge is sufficient for tasks.

02

Unified QA format enables cross-task generalization evaluation.

03

Annotations reveal gaps in models' commonsense inference.

Abstract

Recently, the community has achieved substantial progress on many commonsense reasoning benchmarks. However, it is still unclear what is learned from the training process: the knowledge, inference capability, or both? We argue that due to the large scale of commonsense knowledge, it is infeasible to annotate a large enough training set for each task to cover all commonsense for learning. Thus we should separate the commonsense knowledge acquisition and inference over commonsense knowledge as two separate tasks. In this work, we focus on investigating models' commonsense inference capabilities from two perspectives: (1) Whether models can know if the knowledge they have is enough to solve the task; (2) Whether models can develop commonsense inference capabilities that generalize across commonsense tasks. We first align commonsense tasks with relevant knowledge from commonsense knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsALIGN