DialoGen: Generalized Long-Range Context Representation for Dialogue Systems
Suvodip Dey, Maunendra Sankar Desarkar, Asif Ekbal, P.K. Srijith

TL;DR
DialoGen introduces a novel dialogue context representation framework that identifies and utilizes the most relevant historical utterances, enabling better long-range dependency modeling and compact dialogue history encoding.
Contribution
The paper presents DialoGen, a new encoder-decoder framework that improves dialogue understanding and generation by focusing on relevant past utterances beyond the last-$k$, enhancing long-range context modeling.
Findings
DialoGen performs comparably to state-of-the-art models on DailyDialog.
It achieves similar results on MultiWOZ for dialogue state tracking.
Relevance scores align well with human judgment.
Abstract
Long-range context modeling is crucial to both dialogue understanding and generation. The most popular method for dialogue context representation is to concatenate the last- utterances in chronological order. However, this method may not be ideal for conversations containing long-range dependencies, i.e., when there is a need to look beyond last- utterances to generate a meaningful response. In this work, we propose DialoGen, a novel encoder-decoder based framework for dialogue generation with a generalized context representation that can look beyond the last- utterances. The main idea of our approach is to identify and utilize the most relevant historical utterances instead of last-, which also enables the compact representation of dialogue history with fewer tokens. We study the effectiveness of our proposed method on both dialogue generation (open-domain) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsDynamic Sparse Training
