Read Quietly, Think Aloud: Decoupling Comprehension and Reasoning in LLMs
Yuanxin Wang, Ganesh Venkatesh

TL;DR
This paper explores methods to enable Large Language Models to perform internal silent reading and reasoning, improving their comprehension and response quality by mimicking human-like internal processing stages.
Contribution
It introduces simple yet effective techniques like contextual prompts and a 'reading buddy' architecture to enhance LLMs' internal processing capabilities.
Findings
Significant performance improvements with initial contextual prompts
Enhanced understanding through the 'reading buddy' architecture
Multiple point accuracy boosts observed
Abstract
Large Language Models (LLMs) have demonstrated remarkable proficiency in understanding text and generating high-quality responses. However, a critical distinction from human cognition is their typical lack of a distinct internal `reading' or deliberation phase before `speaking' (i.e., generating text). Humans often engage in silent reading to comprehend context and formulate thoughts prior to articulation. This paper investigates methods to imbue LLMs with a similar capacity for internal processing. We introduce and evaluate techniques that encourage LLMs to `read silently.' Our findings indicate that even a straightforward approach, such as providing the model with an initial contextual prompt or `reading space' before it begins predicting subsequent tokens for the final output, can yield significant performance improvements. We further enhance this concept by developing a `reading…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
