How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning

Haoyang Chen; Yi Liu; Jianzhi Shao; Tao Zhang; Chengfu Huo; Wei Hu

arXiv:2604.19149·cs.CL·April 22, 2026

How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning

Haoyang Chen, Yi Liu, Jianzhi Shao, Tao Zhang, Chengfu Huo, Wei Hu

PDF

TL;DR

This paper investigates how large language models read and interpret reasoning traces during quantitative reasoning, revealing patterns linked to correctness and proposing a training-free steering method to improve accuracy.

Contribution

It uncovers self-reading patterns associated with correct reasoning and introduces a novel, training-free steering approach based on Self-Reading Quality scores to enhance model performance.

Findings

01

Benign self-reading patterns correlate with correct answers.

02

Incorrect solutions show diffuse and irregular attention.

03

The proposed SRQ-based steering improves inference accuracy.

Abstract

Thinking LLMs produce reasoning traces before answering. Prior activation steering work mainly targets on shaping these traces. It remains less understood how answer tokens actually read and integrate the reasoning to produce reliable outcomes. Focusing on quantitative reasoning, we analyze the answer-to-reasoning attention and observe a benign self-reading pattern aligned with correctness, characterized by a forward drift of the reading focus along the reasoning trace and a persistent concentration on key semantic anchors, whereas incorrect solutions exhibit diffuse and irregular attention pattern. We interpret this as internal certainty during answer decoding, where the model commits to a viable solution branch and integrates key evidence. Following this, we propose a training-free steering method driven by Self-Reading Quality (SRQ) scores combining geometric metrics for process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.