Awakening LLMs' Reasoning Potential: A Fine-Grained Pipeline to Evaluate and Mitigate Vague Perception

Zipeng Ling; Yuehao Tang; Shuliang Liu; Junqi Yang; Shenghong Fu; Chen Huang; Kejia Huang; Yao Wan; Zhichao Hou; Xuming Hu

arXiv:2507.16199·cs.CL·January 7, 2026

Awakening LLMs' Reasoning Potential: A Fine-Grained Pipeline to Evaluate and Mitigate Vague Perception

Zipeng Ling, Yuehao Tang, Shuliang Liu, Junqi Yang, Shenghong Fu, Chen Huang, Kejia Huang, Yao Wan, Zhichao Hou, Xuming Hu

PDF

TL;DR

This paper introduces the WakenLLM pipeline to evaluate and enhance LLMs' reasoning by addressing Vague Perception, significantly improving their ability to correctly solve questions they previously abstained from, without additional training.

Contribution

The paper formalizes Vague Perception in LLMs and proposes a pipeline to measure and activate their latent reasoning potential through stage-wise metrics and stimulation techniques.

Findings

01

LLMs can improve up to 68.53% accuracy on Vague Perception samples using WakenLLM.

02

Existing reasoning baselines activate only a small portion of LLMs' potential.

03

Vague Perception varies across model families and sizes, informing model selection strategies.

Abstract

Large language models (LLMs) are increasingly trained to abstain on difficult questions by answering unknown. However, we observe that LLMs often misuse this option: they output unknown even when LLMs can actually solve the questions, or they fail to understand why questions are truly unsolvable. We formalize this mismatch between potential ability and the inclination of abstention as the Vague Perception phenomenon. We introduce the WakenLLM pipeline that (1) extracts Vague Perception samples and (2) measures how many of them can be converted to correct answers under stimulation. Based on stage-wise metrics (TCR, OCR, etc.) and the upper-bound accuracy Acc(WakenLLM), we quantify LLMs' reasoning potential beyond one-shot accuracy. Experiments on six LLMs suggest that, without further training or parameter revisions, LLMs can achieve up to a 68.53% increase in accuracy on Vague…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.