Unlocking the Potential of User Feedback: Leveraging Large Language   Model as User Simulator to Enhance Dialogue System

Zhiyuan Hu; Yue Feng; Anh Tuan Luu; Bryan Hooi; Aldo Lipani

arXiv:2306.09821·cs.CL·October 20, 2023·1 cites

Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, Aldo Lipani

PDF

Open Access 1 Repo

TL;DR

This paper introduces UGRO, a novel method that leverages large language models as user simulators to improve task-oriented dialogue systems, achieving superior results by optimizing smaller models with LLM-generated feedback.

Contribution

The paper proposes UGRO, an innovative approach that combines LLMs as user simulators with smaller TOD models to enhance dialogue response quality.

Findings

01

UGRO outperforms previous state-of-the-art methods on two TOD benchmarks.

02

Using LLMs as user simulators effectively guides smaller models to generate higher satisfaction responses.

03

Empirical results validate the effectiveness of the proposed approach.

Abstract

Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated by the goal of leveraging LLMs, we propose an alternative approach called User-Guided Response Optimization (UGRO) to combine it with a smaller TOD model. This approach uses LLM as annotation-free user simulator to assess dialogue responses, combining them with smaller fine-tuned end-to-end TOD models. By utilizing the satisfaction feedback generated by LLMs, UGRO further optimizes the supervised fine-tuned TOD model. Specifically, the TOD model takes the dialogue history as input and, with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhiyuanhubj/ugro-cimk23
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning