Beyond Output Critique: Self-Correction via Task Distillation
Hossein A. Rahmani, Mengting Wan, Pei Zhou, Longqi Yang, Nick Craswell, Emine Yilmaz, Sujay Kumar Jauhar

TL;DR
The paper introduces SELF-THOUGHT, a framework that enhances LLM self-correction by using task abstraction to guide solution refinement, improving accuracy and robustness across models.
Contribution
It proposes a novel task distillation approach that transfers structured task templates from large to small models, enabling more reliable self-correction without extensive fine-tuning.
Findings
Improves accuracy and robustness in diverse reasoning tasks.
Enables small models to benefit from large model templates.
Reduces error propagation through task abstraction.
Abstract
Large language models (LLMs) have shown promising self-correction abilities, where iterative refinement improves the quality of generated responses. However, most existing approaches operate at the level of output critique, patching surface errors while often failing to correct deeper reasoning flaws. We propose SELF-THOUGHT, a framework that introduces an intermediate step of task abstraction before solution refinement. Given an input and an initial response, the model first distills the task into a structured template that captures key variables, constraints, and problem structure. This abstraction then guides solution instantiation, grounding subsequent responses in a clearer understanding of the task and reducing error propagation. Crucially, we show that these abstractions can be transferred across models: templates generated by larger models can serve as structured guides for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
