Hierarchical Text Generation and Planning for Strategic Dialogue

Denis Yarats; Mike Lewis

arXiv:1712.05846·cs.CL·June 6, 2018·29 cites

Hierarchical Text Generation and Planning for Strategic Dialogue

Denis Yarats, Mike Lewis

PDF

Open Access 1 Repo

TL;DR

This paper presents a hierarchical latent-variable model for goal-oriented dialogue that improves strategic planning, decision making, and linguistic quality by decoupling semantics from language realization, leading to better task success and more effective reinforcement learning.

Contribution

It introduces a novel hierarchical latent-variable framework that separates dialogue semantics from language, enhancing planning and learning in goal-oriented dialogue systems.

Findings

01

Increases end-task reward in dialogue models

02

Enhances long-term planning with rollouts

03

Enables self-play reinforcement learning without language divergence

Abstract

End-to-end models for goal-orientated dialogue are challenging to train, because linguistic and strategic aspects are entangled in latent state vectors. We introduce an approach to learning representations of messages in dialogues by maximizing the likelihood of subsequent sentences and actions, which decouples the semantics of the dialogue utterance from its linguistic realization. We then use these latent sentence representations for hierarchical language generation, planning and reinforcement learning. Experiments show that our approach increases the end-task reward achieved by the model, improves the effectiveness of long-term planning using rollouts, and allows self-play reinforcement learning to improve decision making without diverging from human language. Our hierarchical latent-variable model outperforms previous work both linguistically and strategically.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/end-to-end-negotiator
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques