SUMBot: Summarizing Context in Open-Domain Dialogue Systems
Rui Ribeiro, Lu\'isa Coheur

TL;DR
This paper introduces SUMBot, a method that enhances open-domain dialogue systems by replacing dialogue history with summaries, improving relevance and context retention within model input size constraints.
Contribution
The paper proposes a simple yet effective approach to incorporate summarized context in dialogue systems, addressing input size limitations and relevance issues.
Findings
Summarized context improves response quality.
Replacing dialogue history with summaries enhances relevance.
The method helps retain important information within token limits.
Abstract
In this paper, we investigate the problem of including relevant information as context in open-domain dialogue systems. Most models struggle to identify and incorporate important knowledge from dialogues and simply use the entire turns as context, which increases the size of the input fed to the model with unnecessary information. Additionally, due to the input size limitation of a few hundred tokens of large pre-trained models, regions of the history are not included and informative parts from the dialogue may be omitted. In order to surpass this problem, we introduce a simple method that substitutes part of the context with a summary instead of the whole history, which increases the ability of models to keep track of all the previous relevant information. We show that the inclusion of a summary may improve the answer generation task and discuss some examples to further understand the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsLinear Layer · Cosine Annealing · Residual Connection · Attention Dropout · Linear Warmup With Cosine Annealing · Dropout · Adam · Discriminative Fine-Tuning · Weight Decay · Byte Pair Encoding
