Which Kind Is Better in Open-domain Multi-turn Dialog,Hierarchical or Non-hierarchical Models? An Empirical Study
Tian Lan, Xian-Ling Mao, Wei Wei, Heyan Huang

TL;DR
This paper systematically compares hierarchical and non-hierarchical models for open-domain multi-turn dialog generation, finding that most hierarchical models underperform except when enhanced with word-level attention, which significantly improves their effectiveness.
Contribution
The study provides a comprehensive empirical comparison of hierarchical and non-hierarchical models, revealing the impact of word-level attention mechanisms on model performance.
Findings
Most hierarchical models are worse than non-hierarchical models, except HRAN.
Incorporating word-level attention improves hierarchical models significantly.
Word-level attention enhances context understanding, especially for fine-grained information.
Abstract
Currently, open-domain generative dialog systems have attracted considerable attention in academia and industry. Despite the success of single-turn dialog generation, multi-turn dialog generation is still a big challenge. So far, there are two kinds of models for open-domain multi-turn dialog generation: hierarchical and non-hierarchical models. Recently, some works have shown that the hierarchical models are better than non-hierarchical models under their experimental settings; meanwhile, some works also demonstrate the opposite conclusion. Due to the lack of adequate comparisons, it's not clear which kind of models are better in open-domain multi-turn dialog generation. Thus, in this paper, we will measure systematically nearly all representative hierarchical and non-hierarchical models over the same experimental settings to check which kind is better. Through extensive experiments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
