Propositional Logic for Probing Generalization in Neural Networks
Anna Langedijk, Jaap Jumelet, Willem Zuidema

TL;DR
This paper investigates how well neural networks like Transformers, GCNs, and LSTMs can generalize in propositional logic tasks, revealing significant challenges in learning systematic rule representations, especially with negation.
Contribution
It introduces a balanced dataset for testing neural models' generalization in propositional logic and evaluates the models' ability to handle unseen operator combinations.
Findings
Transformers struggle with negation generalization without structural biases.
All models perform well in-distribution but poorly on unseen patterns.
Standard architectures have limitations in learning systematic logical rules.
Abstract
The extent to which neural networks are able to acquire and represent symbolic rules remains a key topic of research and debate. Much current work focuses on the impressive capabilities of large language models, as well as their often ill-understood failures on a wide range of reasoning tasks. In this paper, in contrast, we investigate the generalization behavior of three key neural architectures (Transformers, Graph Convolution Networks and LSTMs) in a controlled task rooted in propositional logic. The task requires models to generate satisfying assignments for logical formulas, making it a structured and interpretable setting for studying compositionality. We introduce a balanced extension of an existing dataset to eliminate superficial patterns and enable testing on unseen operator combinations. Using this dataset, we evaluate the ability of the three architectures to generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Explainable Artificial Intelligence (XAI)
MethodsConvolution
