Neural Response Generation with Dynamic Vocabularies
Yu Wu, Wei Wu, Dejian Yang, Can Xu, Zhoujun Li, Ming Zhou

TL;DR
This paper introduces DVS2S, a dynamic vocabulary model for chatbot response generation that improves response quality and efficiency by customizing vocabularies per input, reducing noise and decoding time.
Contribution
The paper proposes a novel dynamic vocabulary sequence-to-sequence model that jointly learns vocabulary construction and response generation, enhancing response relevance and decoding efficiency.
Findings
Outperforms state-of-the-art methods in response quality.
Reduces decoding time to 60% of the most efficient baseline.
Effectively filters generic patterns and irrelevant words.
Abstract
We study response generation for open domain conversation in chatbots. Existing methods assume that words in responses are generated from an identical vocabulary regardless of their inputs, which not only makes them vulnerable to generic patterns and irrelevant noise, but also causes a high cost in decoding. We propose a dynamic vocabulary sequence-to-sequence (DVS2S) model which allows each input to possess their own vocabulary in decoding. In training, vocabulary construction and response generation are jointly learned by maximizing a lower bound of the true objective with a Monte Carlo sampling method. In inference, the model dynamically allocates a small vocabulary for an input with the word prediction model, and conducts decoding only with the small vocabulary. Because of the dynamic vocabulary mechanism, DVS2S eludes many generic patterns and irrelevant words in generation, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · AI in Service Interactions · Speech and dialogue systems
