Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
Yiwen Ding, Zhiheng Xi, Wei He, Zhuoyuan Li, Yitao Zhai, Xiaowei Shi,, Xunliang Cai, Tao Gui, Qi Zhang, Xuanjing Huang

TL;DR
This paper introduces Guided Self-Improvement (GSI), a Socratic-guided sampling method that enhances the efficiency of LLM self-improvement by focusing on challenging queries, reducing computational costs, and overcoming tail distribution issues.
Contribution
The paper proposes GSI, a novel Socratic-guided sampling strategy that improves the efficiency of LLM self-improvement on complex queries, addressing tail narrowing without high computational costs.
Findings
GSI improves performance on diverse mathematical tasks.
GSI reduces computational overhead compared to brute-force sampling.
GSI maintains effectiveness on held-out tasks.
Abstract
Self-improvement methods enable large language models (LLMs) to generate solutions themselves and iteratively train on filtered, high-quality rationales. This process proves effective and reduces the reliance on human supervision in LLMs' reasoning, but the performance soon plateaus. We delve into the process and find that models tend to over-sample on easy queries and under-sample on queries they have yet to master. As iterations proceed, this imbalance in sampling is exacerbated, leading to a long-tail distribution where solutions to difficult queries almost diminish. This phenomenon limits the performance gain of self-improving models. A straightforward solution is brute-force sampling to balance the distribution, which significantly raises computational costs. In this paper, we introduce Guided Self-Improvement (GSI), a strategy aimed at improving the efficiency of sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods
