Measuring Systematic Generalization in Neural Proof Generation with   Transformers

Nicolas Gontier; Koustuv Sinha; Siva Reddy; Christopher Pal

arXiv:2009.14786·cs.LG·October 22, 2020·28 cites

Measuring Systematic Generalization in Neural Proof Generation with Transformers

Nicolas Gontier, Koustuv Sinha, Siva Reddy, Christopher Pal

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper investigates the systematic reasoning abilities of Transformer language models in natural language logical proofs, revealing strengths in backward-chaining and challenges with longer sequences, highlighting their internal reasoning strategies.

Contribution

It provides new insights into how TLMs generalize in logical reasoning tasks, especially comparing forward and backward chaining, and shows how training on longer proofs improves performance.

Findings

01

TLMs improve generalization after exposure to longer proofs

02

Backward-chaining proofs lead to better generalization than forward-chaining

03

Models not trained to generate proofs generalize better to longer problems

Abstract

We are interested in understanding how well Transformer language models (TLMs) can perform reasoning tasks when trained on knowledge encoded in the form of natural language. We investigate their systematic generalization abilities on a logical reasoning task in natural language, which involves reasoning over relationships between entities grounded in first-order logical proofs. Specifically, we perform soft theorem-proving by leveraging TLMs to generate natural language proofs. We test the generated proofs for logical consistency, along with the accuracy of the final inference. We observe length-generalization issues when evaluated on longer-than-trained sequences. However, we observe TLMs improve their generalization performance after being exposed to longer, exhaustive proofs. In addition, we discover that TLMs are able to generalize better using backward-chaining proofs compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Measuring Systematic Generalization in Neural Proof Generation with Transformers· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Dropout · Layer Normalization · Byte Pair Encoding · Label Smoothing · Multi-Head Attention · Attention Is All You Need