Compositional Processing Emerges in Neural Networks Solving Math Problems
Jacob Russin, Roland Fernandez, Hamid Palangi, Eric Rosen, Nebojsa, Jojic, Paul Smolensky, Jianfeng Gao

TL;DR
This paper demonstrates that neural networks trained on math problems can learn and apply compositional rules, similar to human cognition, to infer structured relationships and compose meanings into complex solutions.
Contribution
It extends understanding of compositionality in neural networks from language to mathematical reasoning, showing networks can infer and utilize structured relationships.
Findings
Neural networks infer structured relationships in math data.
Networks can compose meanings based on learned rules.
Results suggest emergent compositional processing in neural models.
Abstract
A longstanding question in cognitive science concerns the learning mechanisms underlying compositionality in human cognition. Humans can infer the structured relationships (e.g., grammatical rules) implicit in their sensory observations (e.g., auditory speech), and use this knowledge to guide the composition of simpler meanings into complex wholes. Recent progress in artificial neural networks has shown that when large models are trained on enough linguistic data, grammatical structure emerges in their representations. We extend this work to the domain of mathematical reasoning, where it is possible to formulate precise hypotheses about how meanings (e.g., the quantities corresponding to numerals) should be composed according to structured rules (e.g., order of operations). Our work shows that neural networks are not only able to infer something about the structured relationships…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Neural Networks and Applications · Speech and dialogue systems
