Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion   Dialogues via Reinforcement Learning and Human Demonstration

Weiyan Shi; Yu Li; Saurav Sahay; Zhou Yu

arXiv:2012.15375·cs.CL·October 25, 2022

Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration

Weiyan Shi, Yu Li, Saurav Sahay, Zhou Yu

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach combined with human demonstration to improve persuasion dialogue systems by reducing repetition and inconsistency, leading to more persuasive and diverse conversations without relying on user simulators.

Contribution

The study proposes a novel RL-based refinement method that distills sentence-level rewards and learns from human demonstrations to enhance persuasion dialogue quality.

Findings

01

Outperforms previous models on automatic and human evaluations

02

Generates more diverse and consistent responses

03

Achieves higher persuasion success in donation tasks

Abstract

Persuasion dialogue systems reflect the machine's ability to make strategic moves beyond verbal communication, and therefore differentiate themselves from task-oriented or open-domain dialogue systems and have their own unique values. However, the repetition and inconsistency problems still persist in dialogue response generation and could substantially impact user experience and impede the persuasion outcome. Besides, although reinforcement learning (RL) approaches have achieved big success in strategic tasks such as games, they require a sophisticated user simulator to provide real-time feedback to the dialogue system, which limits the application of RL on persuasion dialogues. To address these issues towards a better persuasion dialogue system, we apply RL to refine a language model baseline without user simulators, and distill sentence-level information about repetition,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions