The Geometry of Numerical Reasoning: Language Models Compare Numeric   Properties in Linear Subspaces

Ahmed Oumar El-Shangiti; Tatsuya Hiraoka; Hilal AlQuabeh and; Benjamin Heinzerling; Kentaro Inui

arXiv:2410.13194·cs.CL·February 11, 2025

The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces

Ahmed Oumar El-Shangiti, Tatsuya Hiraoka, Hilal AlQuabeh and, Benjamin Heinzerling, Kentaro Inui

PDF

Open Access 1 Video

TL;DR

This study reveals that large language models encode numerical attributes in low-dimensional subspaces within their embeddings and that manipulating these subspaces can alter their numerical reasoning outcomes.

Contribution

The paper identifies and manipulates low-dimensional subspaces encoding numerical attributes in LLMs, demonstrating their role in numerical reasoning.

Findings

01

LLMs encode numerical attributes in linear subspaces.

02

Intervening in these subspaces changes model outputs.

03

Results are consistent across different models and attributes.

Abstract

This paper investigates whether large language models (LLMs) utilize numerical attributes encoded in a low-dimensional subspace of the embedding space when answering questions involving numeric comparisons, e.g., Was Cristiano born before Messi? We first identified, using partial least squares regression, these subspaces, which effectively encode the numerical attributes associated with the entities in comparison prompts. Further, we demonstrate causality, by intervening in these subspaces to manipulate hidden states, thereby altering the LLM's comparison outcomes. Experiments conducted on three different LLMs showed that our results hold across different numerical attributes, indicating that LLMs utilize the linearly encoded information for numerical reasoning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces· underline

Taxonomy

TopicsNatural Language Processing Techniques · Model-Driven Software Engineering Techniques · Constraint Satisfaction and Optimization