Constructive Large Language Models Alignment with Diverse Feedback
Tianshu Yu, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang, Yongbin Li

TL;DR
This paper introduces a novel alignment method for large language models that combines diverse feedback types inspired by constructivist learning, leading to improved performance with less training data across multiple tasks.
Contribution
The paper proposes Constructive and Diverse Feedback (CDF), a new approach that integrates critique, refinement, and preference feedback to enhance LLM alignment.
Findings
CDF outperforms previous methods in downstream tasks.
CDF achieves better alignment with less training data.
Experimental results validate the effectiveness of diversified feedback.
Abstract
In recent research on large language models (LLMs), there has been a growing emphasis on aligning these models with human values to reduce the impact of harmful content. However, current alignment methods often rely solely on singular forms of human feedback, such as preferences, annotated labels, or natural language critiques, overlooking the potential advantages of combining these feedback types. This limitation leads to suboptimal performance, even when ample training data is available. In this paper, we introduce Constructive and Diverse Feedback (CDF) as a novel method to enhance LLM alignment, inspired by constructivist learning theory. Our approach involves collecting three distinct types of feedback tailored to problems of varying difficulty levels within the training dataset. Specifically, we exploit critique feedback for easy problems, refinement feedback for medium problems,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
