Hierarchical Recurrent Attention Network for Response Generation

Chen Xing; Wei Wu; Yu Wu; Ming Zhou; Yalou Huang; Wei-Ying Ma

arXiv:1701.07149·cs.CL·January 26, 2017·116 cites

Hierarchical Recurrent Attention Network for Response Generation

Chen Xing, Wei Wu, Yu Wu, Ming Zhou, Yalou Huang, Wei-Ying Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces HRAN, a hierarchical recurrent attention network that improves multi-turn response generation in chatbots by focusing on important words and utterances within conversation context.

Contribution

The paper presents a novel hierarchical attention mechanism that jointly models word and utterance importance for better response generation.

Findings

01

HRAN outperforms existing models in automatic evaluation.

02

HRAN achieves higher human judgment scores.

03

The model effectively captures relevant context information.

Abstract

We study multi-turn response generation in chatbots where a response is generated according to a conversation context. Existing work has modeled the hierarchy of the context, but does not pay enough attention to the fact that words and utterances in the context are differentially important. As a result, they may lose important information in context and generate irrelevant responses. We propose a hierarchical recurrent attention network (HRAN) to model both aspects in a unified framework. In HRAN, a hierarchical attention mechanism attends to important parts within and among utterances with word level attention and utterance level attention respectively. With the word level attention, hidden vectors of a word level encoder are synthesized as utterance vectors and fed to an utterance level encoder to construct hidden representations of the context. The hidden vectors of the context are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LynetteXing1991/HRAN
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions