Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances
Zekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng, Jie Zhou

TL;DR
This paper introduces DialoFlow, a novel dialogue model that captures dynamic information flow across utterances, significantly improving response quality and providing a new automatic evaluation metric for human-bot interactions.
Contribution
The paper proposes DialoFlow with a dynamic flow mechanism and training objectives to model information dynamics, advancing beyond the flat concatenation approach in dialogue modeling.
Findings
DialoFlow outperforms DialoGPT on Reddit and DailyDialog datasets.
Flow score correlates highly with human ratings ($r=0.9$).
The approach effectively models semantic influence across dialogue turns.
Abstract
Nowadays, open-domain dialogue models can generate acceptable responses according to the historical context based on the large-scale pre-trained language models. However, they generally concatenate the dialogue history directly as the model input to predict the response, which we named as the flat pattern and ignores the dynamic information flow across dialogue utterances. In this work, we propose the DialoFlow model, in which we introduce a dynamic flow mechanism to model the context flow, and design three training objectives to capture the information dynamics across dialogue utterances by addressing the semantic influence brought about by each utterance in large-scale pre-training. Experiments on the multi-reference Reddit Dataset and DailyDialog Dataset demonstrate that our DialoFlow significantly outperforms the DialoGPT on the dialogue generation task. Besides, we propose the Flow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
