Number Representations in LLMs: A Computational Parallel to Human Perception
H.V. AlquBoj, Hilal AlQuabeh, Velibor Bojkovic, Tatsuya Hiraoka, Ahmed, Oumar El-Shangiti, Munachiso Nwadike, Kentaro Inui

TL;DR
This paper investigates whether large language models encode numerical information in a logarithmic, human-like manner, revealing that their internal representations exhibit sublinear spacing similar to human perception.
Contribution
The study demonstrates that LLMs encode numbers in a compressed, non-uniform way, aligning with the logarithmic mental number line hypothesis from cognitive science.
Findings
LLMs show sublinear spacing in numerical representations
Numerical embeddings align with a logarithmic scale
Models encode numbers in a human-like perceptual manner
Abstract
Humans are believed to perceive numbers on a logarithmic mental number line, where smaller values are represented with greater resolution than larger ones. This cognitive bias, supported by neuroscience and behavioral studies, suggests that numerical magnitudes are processed in a sublinear fashion rather than on a uniform linear scale. Inspired by this hypothesis, we investigate whether large language models (LLMs) exhibit a similar logarithmic-like structure in their internal numerical representations. By analyzing how numerical values are encoded across different layers of LLMs, we apply dimensionality reduction techniques such as PCA and PLS followed by geometric regression to uncover latent structures in the learned embeddings. Our findings reveal that the model's numerical representations exhibit sublinear spacing, with distances between values aligning with a logarithmic scale.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Mathematics, Computing, and Information Processing
MethodsPrincipal Components Analysis
