On Representational Dissociation of Language and Arithmetic in Large Language Models
Riku Kisako, Tatsuki Kuribayashi, Ryohei Sasano

TL;DR
This paper investigates whether large language models encode arithmetic reasoning separately from general language, finding that simple arithmetic and language are represented in distinct, geometrically separable regions within the models.
Contribution
It provides the first analysis of the geometric separation of arithmetic and language representations in LLMs, aligning with neuroscientific findings about human brain activity.
Findings
Arithmetic and language are encoded in separate regions in LLMs.
Linear classifiers can distinguish between arithmetic and language representations.
Arithmetic representations show distinct geometric properties in the model's space.
Abstract
The association between language and (non-linguistic) thinking ability in humans has long been debated, and recently, neuroscientific evidence of brain activity patterns has been considered. Such a scientific context naturally raises an interdisciplinary question -- what about such a language-thought dissociation in large language models (LLMs)? In this paper, as an initial foray, we explore this question by focusing on simple arithmetic skills (e.g., ?) as a thinking ability and analyzing the geometry of their encoding in LLMs' representation space. Our experiments with linear classifiers and cluster separability tests demonstrate that simple arithmetic equations and general language input are encoded in completely separated regions in LLMs' internal representation space across all the layers, which is also supported with more controlled stimuli (e.g., spelled-out equations).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Neural Networks and Applications · Topic Modeling
