Investigating the interaction of linguistic and mathematical reasoning in language models using multilingual number puzzles
Antara Raaghavi Bhattacharya, Isabel Papadimitriou, Kathryn Davidson, David Alvarez-Melis

TL;DR
This paper explores why large language models struggle with multilingual number puzzles, revealing that explicit mathematical symbols are crucial and that models lack the implicit numeral structure understanding humans use.
Contribution
The study uncovers the importance of explicit symbols for mathematical reasoning in LLMs and highlights their inability to infer implicit numeral structures from language data.
Findings
Models require explicit mathematical symbols to solve number puzzles.
LLMs lack the ability to infer implicit numeral structures.
Humans use linguistic cues to understand number composition, unlike current models.
Abstract
Across languages, numeral systems vary widely in how they construct and combine numbers. While humans consistently learn to navigate this diversity, large language models (LLMs) struggle with linguistic-mathematical puzzles involving cross-linguistic numeral systems, which humans can learn to solve successfully. We investigate why this task is difficult for LLMs through a series of experiments that untangle the linguistic and mathematical aspects of numbers in language. Our experiments establish that models cannot consistently solve such problems unless the mathematical operations in the problems are explicitly marked using known symbols (, , etc., as in "twenty + three"). In further ablation studies, we probe how individual parameters of numeral construction and combination affect performance. While humans use their linguistic understanding of numbers to make inferences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Second Language Acquisition and Learning · Text Readability and Simplification
