Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

Jon-Paul Cacioli

arXiv:2603.28258·cs.CL·April 27, 2026

Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

Jon-Paul Cacioli

PDF

TL;DR

This study reveals that large language models exhibit geometric warping in their hidden states at digit-count boundaries, akin to perceptual categorical perception, driven by structural input discontinuities.

Contribution

It demonstrates that structural input-format discontinuities alone can induce categorical perception geometry in LLMs, independent of semantic category knowledge.

Findings

01

Representational geometry fits a CP-additive model better than a continuous model.

02

The effect is specific to digit-count boundaries and absent at control positions.

03

Different models show either explicit categorization or mere geometric warping at boundaries.

Abstract

Categorical perception (CP) -- enhanced discriminability at category boundaries -- is among the most studied phenomena in perceptual psychology. This paper reports that analogous geometric warping occurs in the hidden-state representations of large language models (LLMs) processing Arabic numerals. Using representational similarity analysis across six models from five architecture families, the study finds that a CP-additive model (log-distance plus a boundary boost) fits the representational geometry better than a purely continuous model at 100% of primary layers in every model tested. The effect is specific to structurally defined boundaries (digit-count transitions at 10 and 100), absent at non-boundary control positions, and absent in the temperature domain where linguistic categories (hot/cold) lack a tokenisation discontinuity. Two qualitatively distinct signatures emerge:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.