System-Level Natural Language Feedback
Weizhe Yuan, Kyunghyun Cho, Jason Weston

TL;DR
This paper introduces a novel framework for utilizing natural language feedback at the system level to improve models, focusing on metric and prompt design, with case studies in search and dialogue systems demonstrating its effectiveness.
Contribution
It presents a new approach for system-level use of NL feedback, combining it with instance-level feedback for enhanced model refinement.
Findings
System-level feedback improves model performance in search and dialogue tasks.
Combining system-level and instance-level feedback yields further gains.
Human-written feedback results in more grounded refinements than GPT-3.5 generated feedback.
Abstract
Natural language (NL) feedback offers rich insights into user experience. While existing studies focus on an instance-level approach, where feedback is used to refine specific examples, we introduce a framework for system-level use of NL feedback. We show how to use feedback to formalize system-level design decisions in a human-in-the-loop-process -- in order to produce better models. In particular this is done through: (i) metric design for tasks; and (ii) language model prompt design for refining model responses. We conduct two case studies of this approach for improving search query and dialog response generation, demonstrating the effectiveness of system-level feedback. We show the combination of system-level and instance-level feedback brings further gains, and that human written instance-level feedback results in more grounded refinements than GPT-3.5 written ones, underlying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Cognitive Computing and Networks · Neural Networks and Applications
MethodsAttention Is All You Need · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Byte Pair Encoding · Residual Connection · Weight Decay · Softmax · Dropout · Attention Dropout · Dense Connections
