Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression
Kai Yoshida, Masahiro Mizukami, Seiya Kawano, Canasai Kruengkrai,, Hiroaki Sugiyama, Koichiro Yoshino

TL;DR
This paper introduces a supervised fine-tuning approach using reward models based on LLMs to enhance dialogue system impressions like consistency and empathy, showing improved responses through automatic and human evaluations.
Contribution
It proposes a novel supervised fine-tuning method with reward models for evaluating and improving overall dialogue impressions, addressing challenges in dialogue-level evaluation.
Findings
Improved dialogue response naturalness after fine-tuning
Enhanced evaluation metrics for dialogue impressions
Better alignment with human judgments
Abstract
To improve user engagement during conversations with dialogue systems, we must improve individual dialogue responses and dialogue impressions such as consistency, personality, and empathy throughout the entire dialogue. While such dialogue systems have been developing rapidly with the help of large language models (LLMs), reinforcement learning from AI feedback (RLAIF) has attracted attention to align LLM-based dialogue models for such dialogue impressions. In RLAIF, a reward model based on another LLM is used to create a training signal for an LLM-based dialogue model using zero-shot/few-shot prompting techniques. However, evaluating an entire dialogue only by prompting LLMs is challenging. In this study, the supervised fine-tuning (SFT) of LLMs prepared reward models corresponding to 12 metrics related to the impression of the entire dialogue for evaluating dialogue responses. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Intelligent Tutoring Systems and Adaptive Learning · AI in Service Interactions
MethodsSoftmax · Attention Is All You Need · Reinforcement Learning from AI Feedback · ALIGN
