Can Transformers Reason in Fragments of Natural Language?

Viktor Schlegel; Kamen V. Pavlov; Ian Pratt-Hartmann

arXiv:2211.05417·cs.CL·November 11, 2022

Can Transformers Reason in Fragments of Natural Language?

Viktor Schlegel, Kamen V. Pavlov, Ian Pratt-Hartmann

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper empirically investigates whether transformer models genuinely reason in natural language fragments, finding they overfit superficial patterns despite high performance, thus questioning their true reasoning capabilities.

Contribution

It provides a large-scale empirical analysis of transformer models' reasoning abilities in complex natural language fragments, highlighting their tendency to overfit superficial cues.

Findings

01

Transformers perform well on reasoning tasks in natural language fragments.

02

They tend to overfit superficial patterns rather than learn logical principles.

03

This raises questions about the true reasoning capabilities of current NLP models.

Abstract

State-of-the-art deep-learning-based approaches to Natural Language Processing (NLP) are credited with various capabilities that involve reasoning with natural language texts. In this paper we carry out a large-scale empirical study investigating the detection of formally valid inferences in controlled fragments of natural language for which the satisfiability problem becomes increasingly complex. We find that, while transformer-based language models perform surprisingly well in these scenarios, a deeper analysis re-veals that they appear to overfit to superficial patterns in the data rather than acquiring the logical principles governing the reasoning in these fragments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

schlevik/nlr
pytorchOfficial

Datasets

tasksource/natural-language-satisfiability
dataset· 16 dl
16 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)