Can Pretrained Language Models (Yet) Reason Deductively?
Zhangdie Yuan, Songbo Hu, Ivan Vuli\'c, Anna Korhonen, Zaiqiao Meng

TL;DR
This paper critically evaluates whether pretrained language models can reliably perform deductive reasoning, revealing significant limitations in their generalization, consistency, and knowledge retention, and highlighting the gap from human-level reasoning.
Contribution
The study provides a comprehensive evaluation of PLMs' deductive reasoning abilities, showing their inadequacies and the challenges in achieving reliable reasoning with current models.
Findings
PLMs inadequately generalize learned logic rules.
PLMs perform inconsistently against simple adversarial edits.
Fine-tuning improves reasoning over new facts but causes catastrophic forgetting.
Abstract
Acquiring factual knowledge with Pretrained Language Models (PLMs) has attracted increasing attention, showing promising performance in many knowledge-intensive tasks. Their good performance has led the community to believe that the models do possess a modicum of reasoning competence rather than merely memorising the knowledge. In this paper, we conduct a comprehensive evaluation of the learnable deductive (also known as explicit) reasoning capability of PLMs. Through a series of controlled experiments, we posit two main findings. (i) PLMs inadequately generalise learned logic rules and perform inconsistently against simple adversarial surface form edits. (ii) While the deductive reasoning fine-tuning of PLMs does improve their performance on reasoning over unseen knowledge facts, it results in catastrophically forgetting the previously learnt knowledge. Our main results suggest that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
