Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation

Guangyue Peng; Zongchao Chen; Wen Luo; Yuntao Wen; Wei Li; Ruixiang Feng; Ran Le; Chen Yang; Zhenwei An; Yang Song; Tao Zhang; Houfeng Wang

arXiv:2602.14469·cs.CL·February 17, 2026

Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation

Guangyue Peng, Zongchao Chen, Wen Luo, Yuntao Wen, Wei Li, Ruixiang Feng, Ran Le, Chen Yang, Zhenwei An, Yang Song, Tao Zhang, Houfeng Wang

PDF

Open Access 2 Datasets

TL;DR

This paper investigates post-hoc rationalization in reverse chain-of-thought generation, formalizes its measurement, analyzes mitigation strategies, and proposes a structural skeleton-guided reasoning method that effectively reduces answer anchoring.

Contribution

It introduces a three-level measurement hierarchy for anchoring, analyzes why semantic suppression fails, and proposes SSR and SSR-D methods to mitigate answer dependence in reasoning traces.

Findings

01

Semantic suppression increases anchoring at entropic and probabilistic levels.

02

SSR reduces anchoring across all measurement levels.

03

SSR-D improves reasoning accuracy by up to 10% over suppression baselines.

Abstract

Reverse Chain-of-Thought Generation (RCG) synthesizes reasoning traces from query-answer pairs, but runs the risk of producing post-hoc rationalizations: when models can see the answer during generation, the answer serves as a cognitive anchor that shapes the entire explanation. We formalize this phenomenon through a three-level measurement hierarchy: lexical, entropic, and probabilistic anchoring, each captures surface artifacts, entropy dynamics, and latent answer dependence, respectively. We analyze semantic suppression, the intuitive mitigation strategy that instructs models to ignore the answer, to find out its counterproduction: while it reduces lexical overlap, it paradoxically increases entropic and probabilistic anchoring. Drawing on Ironic Process Theory from cognitive psychology, we attribute this failure to active monitoring of the forbidden answer, which inadvertently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · AI-based Problem Solving and Planning