Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models

Hirohane Takagi; Gouki Minegishi; Shota Kizawa; Issey Sukeda; Hitomi Yanaka

arXiv:2511.04053·cs.AI·November 11, 2025

Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models

Hirohane Takagi, Gouki Minegishi, Shota Kizawa, Issey Sukeda, Hitomi Yanaka

PDF

Open Access

TL;DR

This paper investigates how large language models encode and are affected by multiple numerical attributes, revealing vulnerabilities in their decision-making processes due to entangled representations and irrelevant contextual influences.

Contribution

It introduces a novel analysis combining linear probing and correlation analysis to understand multi-attribute numerical encoding in LLMs, highlighting systematic amplification and context sensitivity.

Findings

01

LLMs encode real-world numerical correlations.

02

Irrelevant numerical context causes shifts in representations.

03

Downstream effects vary with model size.

Abstract

Although behavioral studies have documented numerical reasoning errors in large language models (LLMs), the underlying representational mechanisms remain unclear. We hypothesize that numerical attributes occupy shared latent subspaces and investigate two questions:(1) How do LLMs internally integrate multiple numerical attributes of a single entity? (2)How does irrelevant numerical context perturb these representations and their downstream outputs? To address these questions, we combine linear probing with partial correlation analysis and prompt-based vulnerability tests across models of varying sizes. Our results show that LLMs encode real-world numerical correlations but tend to systematically amplify them. Moreover, irrelevant context induces consistent shifts in magnitude representations, with downstream effects that vary by model size. These findings reveal a vulnerability in LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Ethics and Social Impacts of AI