Teaching Language Models to Self-Improve by Learning from Language Feedback
Chi Hu, Yimin Hu, Hang Cao, Tong Xiao, Jingbo Zhu

TL;DR
This paper introduces Self-Refinement Tuning (SRT), a novel method enabling language models to self-improve by learning from language-based feedback, reducing dependence on costly human annotations and enhancing alignment with human intentions.
Contribution
The paper proposes SRT, a new self-improvement approach where models critique and refine their outputs using language feedback, outperforming existing baselines on multiple tasks.
Findings
SRT significantly improves model performance on diverse tasks.
SRT increases win rate from 9.6% to 25.8% on AlpacaEval 2.0.
Language feedback is crucial for effective self-improvement.
Abstract
Aligning Large Language Models (LLMs) with human intentions and values is crucial yet challenging. Current methods primarily rely on human preferences, which are costly and insufficient in capturing nuanced feedback expressed in natural language. In this paper, we present Self-Refinement Tuning (SRT), a method that leverages model feedback for alignment, thereby reducing reliance on human annotations. SRT uses a base language model (e.g., Tulu2) to generate initial responses, which are critiqued and refined by a more advanced model (e.g., GPT-4-Turbo). This process enables the base model to self-evaluate and improve its outputs, facilitating continuous learning. SRT further optimizes the model by learning from its self-generated feedback and refinements, creating a feedback loop that promotes model improvement. Our empirical evaluations demonstrate that SRT significantly outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Intelligent Tutoring Systems and Adaptive Learning · Topic Modeling
MethodsBalanced Selection
