Training Dialogue Systems by AI Feedback for Improving Overall Dialogue   Impression

Kai Yoshida; Masahiro Mizukami; Seiya Kawano; Canasai Kruengkrai,; Hiroaki Sugiyama; Koichiro Yoshino

arXiv:2501.12698·cs.CL·January 28, 2025

Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression

Kai Yoshida, Masahiro Mizukami, Seiya Kawano, Canasai Kruengkrai,, Hiroaki Sugiyama, Koichiro Yoshino

PDF

Open Access

TL;DR

This paper introduces a supervised fine-tuning approach using reward models based on LLMs to enhance dialogue system impressions like consistency and empathy, showing improved responses through automatic and human evaluations.

Contribution

It proposes a novel supervised fine-tuning method with reward models for evaluating and improving overall dialogue impressions, addressing challenges in dialogue-level evaluation.

Findings

01

Improved dialogue response naturalness after fine-tuning

02

Enhanced evaluation metrics for dialogue impressions

03

Better alignment with human judgments

Abstract

To improve user engagement during conversations with dialogue systems, we must improve individual dialogue responses and dialogue impressions such as consistency, personality, and empathy throughout the entire dialogue. While such dialogue systems have been developing rapidly with the help of large language models (LLMs), reinforcement learning from AI feedback (RLAIF) has attracted attention to align LLM-based dialogue models for such dialogue impressions. In RLAIF, a reward model based on another LLM is used to create a training signal for an LLM-based dialogue model using zero-shot/few-shot prompting techniques. However, evaluating an entire dialogue only by prompting LLMs is challenging. In this study, the supervised fine-tuning (SFT) of LLMs prepared reward models corresponding to 12 metrics related to the impression of the entire dialogue for evaluating dialogue responses. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Intelligent Tutoring Systems and Adaptive Learning · AI in Service Interactions

MethodsSoftmax · Attention Is All You Need · Reinforcement Learning from AI Feedback · ALIGN