Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory

Xin Cheng; Di Luo; Xiuying Chen; Lemao Liu; Dongyan Zhao; Rui Yan

arXiv:2305.02437·cs.CL·December 27, 2023·21 cites

Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory

Xin Cheng, Di Luo, Xiuying Chen, Lemao Liu, Dongyan Zhao, Rui Yan

PDF

Open Access 1 Repo

TL;DR

This paper introduces selfmem, a novel retrieval-augmented generation framework that iteratively creates and uses its own output as memory, significantly improving text generation tasks like translation, summarization, and dialogue.

Contribution

The paper proposes a self-memory framework that leverages the model's own outputs for enhanced retrieval-augmented generation, addressing limitations of fixed memory sources.

Findings

01

Achieved state-of-the-art ROUGE scores on multiple datasets

02

Demonstrated effectiveness across translation, summarization, and dialogue tasks

03

Analyzed component contributions to identify bottlenecks

Abstract

With direct access to human-written reference as memory, retrieval-augmented generation has achieved much progress in a wide range of text generation tasks. Since better memory would typically prompt better generation~(we define this as primal problem). The traditional approach for memory retrieval involves selecting memory that exhibits the highest similarity to the input. However, this method is constrained by the quality of the fixed corpus from which memory is retrieved. In this paper, by exploring the duality of the primal problem: better generation also prompts better memory, we propose a novel framework, selfmem, which addresses this limitation by iteratively employing a retrieval-augmented generator to create an unbounded memory pool and using a memory selector to choose one output as memory for the subsequent generation round. This enables the model to leverage its own output,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hannibal046/selfmemory
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications