Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation
Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula,, Bowen Zhou, Yoshua Bengio, Aaron Courville

TL;DR
This paper presents a multiresolution RNN that models dialogue generation by capturing high-level discourse and low-level language details, leading to improved performance in technical and social conversation domains.
Contribution
The paper introduces a novel multiresolution RNN framework that jointly models high-level discourse and natural language tokens, enhancing dialogue response generation.
Findings
Outperforms existing methods on Ubuntu technical support data.
Generates more relevant and on-topic responses on Twitter.
Better captures long-term structure and overcomes language sparsity.
Abstract
We introduce the multiresolution recurrent neural network, which extends the sequence-to-sequence framework to model natural language generation as two parallel discrete stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language tokens. There are many ways to estimate or learn the high-level coarse tokens, but we argue that a simple extraction procedure is sufficient to capture a wealth of high-level discourse semantics. Such procedure allows training the multiresolution recurrent neural network by maximizing the exact joint log-likelihood over both sequences. In contrast to the standard log- likelihood objective w.r.t. natural language tokens (word perplexity), optimizing the joint log-likelihood biases the model towards modeling high-level abstractions. We apply the proposed model to the task of dialogue response generation in two challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
