Geometric Properties of the Voronoi Tessellation in Latent Semantic Manifolds of Large Language Models
Marshall Brett

TL;DR
This paper empirically analyzes the Voronoi tessellation in language model vector spaces, confirming a linear scaling law and demonstrating margin refinement procedures to reshape model geometry without retraining.
Contribution
It validates Mabrok's linear scaling law with high precision and introduces margin refinement procedures to reshape the tessellation, preserving model performance.
Findings
Confirmed Mabrok's linear scaling law with R^2 = 0.9997.
Demonstrated that margin refinement can reshape the tessellation without retraining.
Fisher information distance maximization achieves stable corrections with invariant downstream performance.
Abstract
Language models operate on discrete tokens but compute in continuous vector spaces, inducing a Voronoi tessellation over the representation manifold. We study this tessellation empirically on Qwen3.5-4B-Base, making two contributions. First, using float32 margin recomputation to resolve bfloat16 quantization artifacts, we validate Mabrok's (2026) linear scaling law of the expressibility gap with = 0.9997 - the strongest confirmation to date - and identify a mid-layer geometric ambiguity regime where margin geometry is anti-correlated with cross-entropy (layers 24-28, = -0.29) before crystallizing into alignment at the final layer ( = 0.836). Second, we show that the Voronoi tessellation of a converged model is reshapable through margin refinement procedures (MRP): short post-hoc optimization runs that widen token-decision margins without retraining. We compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
