Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction

Wei Shen; Xiaonan He; Chuheng Zhang; Xuyun Zhang; Xiaolong Xu; Wanchun Dou

arXiv:2505.18731·cs.LG·May 27, 2025

Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction

Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Xiaolong Xu, Wanchun Dou

PDF

Open Access

TL;DR

This paper introduces auxiliary tasks to improve user satisfaction prediction in proactive dialogue agents, addressing noise and data sparsity issues for better interaction strategies.

Contribution

It proposes contrastive self-supervised learning and domain-intent classification tasks to enhance representation learning in reward-driven dialogue systems.

Findings

01

Improved accuracy in error recognition on rare utterances

02

Enhanced performance on long-tailed domain data

03

Significant gains over baseline models in DuerOS evaluation

Abstract

Reward-driven proactive dialogue agents require precise estimation of user satisfaction as an intrinsic reward signal to determine optimal interaction strategies. Specifically, this framework triggers clarification questions when detecting potential user dissatisfaction during interactions in the industrial dialogue system. Traditional works typically rely on training a neural network model based on weak labels which are generated by a simple model trained on user actions after current turn. However, existing methods suffer from two critical limitations in real-world scenarios: (1) Noisy Reward Supervision, dependence on weak labels derived from post-hoc user actions introduces bias, particularly failing to capture satisfaction signals in ASR-error-induced utterances; (2) Long-Tail Feedback Sparsity, the power-law distribution of user queries causes reward prediction accuracy to drop in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · AI in Service Interactions · Topic Modeling