Beyond Goldfish Memory: Long-Term Open-Domain Conversation

Jing Xu; Arthur Szlam; Jason Weston

arXiv:2107.07567·cs.CL·July 19, 2021·41 cites

Beyond Goldfish Memory: Long-Term Open-Domain Conversation

Jing Xu, Arthur Szlam, Jason Weston

PDF

Open Access

TL;DR

This paper introduces a new long-term open-domain conversation dataset and demonstrates that retrieval-augmented and summarization-based models significantly outperform traditional models in maintaining context over multiple sessions.

Contribution

The authors provide a novel dataset for long-term conversations and evaluate models showing the superiority of retrieval-augmented and summarization techniques.

Findings

01

Retrieval-augmented models outperform standard architectures.

02

Models with summarization and recall capabilities perform better.

03

Long-term context handling is crucial for realistic dialogue systems.

Abstract

Despite recent improvements in open-domain dialogue models, state of the art models are trained and evaluated on short conversations with little context. In contrast, the long-term conversation setting has hardly been studied. In this work we collect and release a human-human dataset consisting of multiple chat sessions whereby the speaking partners learn about each other's interests and discuss the things they have learnt from past sessions. We show how existing models trained on existing datasets perform poorly in this long-term conversation setting in both automatic and human evaluations, and we study long-context models that can perform much better. In particular, we find retrieval-augmented methods and methods with an ability to summarize and recall previous conversations outperform the standard encoder-decoder architectures currently considered state of the art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications