A Unified Representation Underlying the Judgment of Large Language Models

Yi-Long Lu; Jiajun Song; Wei Wang

arXiv:2510.27328·cs.CL·November 5, 2025

A Unified Representation Underlying the Judgment of Large Language Models

Yi-Long Lu, Jiajun Song, Wei Wang

PDF

Open Access

TL;DR

This paper uncovers a unified evaluative architecture in Large Language Models, showing that diverse judgments are encoded along a dominant Valence-Assent Axis that influences reasoning and can lead to biases like hallucinations.

Contribution

It provides evidence for a convergent, domain-general evaluative mechanism in LLMs centered on the Valence-Assent Axis, linking judgment and reasoning control.

Findings

01

Diverse evaluative judgments are encoded along the Valence-Assent Axis.

02

The VAA influences the generative process, affecting reasoning and factual accuracy.

03

Interventions on the VAA demonstrate its role as a control signal in judgment.

Abstract

A central architectural question for both biological and artificial intelligence is whether judgment relies on specialized modules or a unified, domain-general resource. While the discovery of decodable neural representations for distinct concepts in Large Language Models (LLMs) has suggested a modular architecture, whether these representations are truly independent systems remains an open question. Here we provide evidence for a convergent architecture for evaluative judgment. Across a range of LLMs, we find that diverse evaluative judgments are computed along a dominant dimension, which we term the Valence-Assent Axis (VAA). This axis jointly encodes subjective valence ("what is good") and the model's assent to factual claims ("what is true"). Through direct interventions, we demonstrate this axis drives a critical mechanism, which is identified as the subordination of reasoning: the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Language and cultural evolution