Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction
Chen Zhang, Kepu Zhang, Jiatong Zhang, Xiao Zhang, Jun Xu

TL;DR
This paper introduces Sandwich Reasoning, a novel approach for low-latency query correction that aligns fast initial answers with detailed reasoning, achieving state-of-the-art accuracy with significantly reduced latency.
Contribution
The paper proposes SandwichR, a new answer-reasoning-answer framework with a consistency-aware reinforcement learning strategy for effective low-latency query correction.
Findings
Achieves state-of-the-art accuracy in query correction.
Reduces latency by 40-70% compared to traditional methods.
Effectively aligns initial answers with reasoning for improved accuracy.
Abstract
Query correction is a critical entry point in modern search pipelines, demanding high accuracy strictly within real-time latency constraints. Chain-of-Thought (CoT) reasoning improves accuracy but incurs prohibitive latency for real-time query correction. A potential solution is to output an answer before reasoning to reduce latency; however, under autoregressive decoding, the early answer is independent of subsequent reasoning, preventing the model from leveraging its reasoning capability to improve accuracy. To address this issue, we propose Sandwich Reasoning (SandwichR), a novel approach that explicitly aligns a fast initial answer with post-hoc reasoning, enabling low-latency query correction without sacrificing reasoning-aware accuracy. SandwichR follows an Answer-Reasoning-Answer paradigm, producing an initial correction, an explicit reasoning process, and a final refined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing
