Attention with Intention for a Neural Network Conversation Model

Kaisheng Yao; Geoffrey Zweig; Baolin Peng

arXiv:1510.08565·cs.NE·November 6, 2015·97 cites

Attention with Intention for a Neural Network Conversation Model

Kaisheng Yao, Geoffrey Zweig, Baolin Peng

PDF

Open Access

TL;DR

This paper introduces a neural network conversation model that integrates attention and intention mechanisms through three recurrent networks, enabling more natural and context-aware responses without labeled data.

Contribution

It presents a novel neural architecture combining attention and intention modeling with end-to-end training for dialogue generation.

Findings

01

Generates more natural responses in conversation tasks.

02

Effectively models intention dynamics during dialogue.

03

Achieves improved response relevance without labeled data.

Abstract

In a conversation or a dialogue process, attention and intention play intrinsic roles. This paper proposes a neural network based approach that models the attention and intention processes. It essentially consists of three recurrent networks. The encoder network is a word-level model representing source side sentences. The intention network is a recurrent network that models the dynamics of the intention process. The decoder network is a recurrent network produces responses to the input from the source side. It is a language model that is dependent on the intention and has an attention mechanism to attend to particular source side words, when predicting a symbol in the response. The model is trained end-to-end without labeling data. Experiments show that this model generates natural responses to user inputs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques