RECKONING: Reasoning through Dynamic Knowledge Encoding

Zeming Chen; Gail Weiss; Eric Mitchell; Asli Celikyilmaz; Antoine; Bosselut

arXiv:2305.06349·cs.CL·November 7, 2023·5 cites

RECKONING: Reasoning through Dynamic Knowledge Encoding

Zeming Chen, Gail Weiss, Eric Mitchell, Asli Celikyilmaz, Antoine, Bosselut

PDF

Open Access 1 Video

TL;DR

RECKONING introduces a bi-level learning method that enables language models to encode contextual knowledge into their parameters, improving reasoning robustness, especially against distractors and longer reasoning chains.

Contribution

The paper proposes RECKONING, a novel bi-level training algorithm that teaches models to encode knowledge into parameters for more reliable reasoning.

Findings

01

Performance improved by up to 4.5% over baseline.

02

Better generalization to longer reasoning chains.

03

More robust to distractor facts.

Abstract

Recent studies on transformer-based language models show that they can answer questions by reasoning over knowledge provided as part of the context (i.e., in-context reasoning). However, since the available knowledge is often not filtered for a particular question, in-context reasoning can be sensitive to distractor facts, additional content that is irrelevant to a question but that may be relevant for a different question (i.e., not necessarily random noise). In these situations, the model fails to distinguish the knowledge that is necessary to answer the question, leading to spurious reasoning and degraded performance. This reasoning failure contrasts with the model's apparent ability to distinguish its contextual knowledge from all the knowledge it has memorized during pre-training. Following this observation, we propose teaching the model to reason more robustly by folding the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

RECKONING: Reasoning through Dynamic Knowledge Encoding· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning