Teaching Language Models to Self-Improve by Learning from Language   Feedback

Chi Hu; Yimin Hu; Hang Cao; Tong Xiao; Jingbo Zhu

arXiv:2406.07168·cs.CL·June 12, 2024

Teaching Language Models to Self-Improve by Learning from Language Feedback

Chi Hu, Yimin Hu, Hang Cao, Tong Xiao, Jingbo Zhu

PDF

Open Access

TL;DR

This paper introduces Self-Refinement Tuning (SRT), a novel method enabling language models to self-improve by learning from language-based feedback, reducing dependence on costly human annotations and enhancing alignment with human intentions.

Contribution

The paper proposes SRT, a new self-improvement approach where models critique and refine their outputs using language feedback, outperforming existing baselines on multiple tasks.

Findings

01

SRT significantly improves model performance on diverse tasks.

02

SRT increases win rate from 9.6% to 25.8% on AlpacaEval 2.0.

03

Language feedback is crucial for effective self-improvement.

Abstract

Aligning Large Language Models (LLMs) with human intentions and values is crucial yet challenging. Current methods primarily rely on human preferences, which are costly and insufficient in capturing nuanced feedback expressed in natural language. In this paper, we present Self-Refinement Tuning (SRT), a method that leverages model feedback for alignment, thereby reducing reliance on human annotations. SRT uses a base language model (e.g., Tulu2) to generate initial responses, which are critiqued and refined by a more advanced model (e.g., GPT-4-Turbo). This process enables the base model to self-evaluate and improve its outputs, facilitating continuous learning. SRT further optimizes the model by learning from its self-generated feedback and refinements, creating a feedback loop that promotes model improvement. Our empirical evaluations demonstrate that SRT significantly outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Intelligent Tutoring Systems and Adaptive Learning · Topic Modeling

MethodsBalanced Selection