RECKONING: Reasoning through Dynamic Knowledge Encoding
Zeming Chen, Gail Weiss, Eric Mitchell, Asli Celikyilmaz, Antoine, Bosselut

TL;DR
RECKONING introduces a bi-level learning method that enables language models to encode contextual knowledge into their parameters, improving reasoning robustness, especially against distractors and longer reasoning chains.
Contribution
The paper proposes RECKONING, a novel bi-level training algorithm that teaches models to encode knowledge into parameters for more reliable reasoning.
Findings
Performance improved by up to 4.5% over baseline.
Better generalization to longer reasoning chains.
More robust to distractor facts.
Abstract
Recent studies on transformer-based language models show that they can answer questions by reasoning over knowledge provided as part of the context (i.e., in-context reasoning). However, since the available knowledge is often not filtered for a particular question, in-context reasoning can be sensitive to distractor facts, additional content that is irrelevant to a question but that may be relevant for a different question (i.e., not necessarily random noise). In these situations, the model fails to distinguish the knowledge that is necessary to answer the question, leading to spurious reasoning and degraded performance. This reasoning failure contrasts with the model's apparent ability to distinguish its contextual knowledge from all the knowledge it has memorized during pre-training. Following this observation, we propose teaching the model to reason more robustly by folding the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
