CMT: A Memory Compression Method for Continual Knowledge Learning of   Large Language Models

Dongfang Li; Zetian Sun; Xinshuo Hu; Baotian Hu; Min Zhang

arXiv:2412.07393·cs.CL·December 11, 2024

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Dongfang Li, Zetian Sun, Xinshuo Hu, Baotian Hu, Min Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces CMT, a memory compression method for LLMs that enables continual learning without retraining, by compressing new information into a memory bank to improve adaptability and knowledge retention.

Contribution

The paper proposes a novel online adaptation framework for LLMs that uses memory compression inspired by human memory, with new techniques for encoding, retrieval, and aggregation.

Findings

01

Improves model adaptability on continual learning datasets

02

Enhances robustness across multiple base LLMs

03

Achieves +4.07 EM and +4.19 F1 in StreamingQA with Llama-2-7b

Abstract

Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However, updates are necessary to keep them in sync with rapidly evolving human knowledge. To address these challenges, this paper proposes the Compression Memory Training (CMT) method, an efficient and effective online adaptation framework for LLMs that features robust knowledge retention capabilities. Inspired by human memory mechanisms, CMT compresses and extracts information from new documents to be stored in a memory bank. When answering to queries related to these new documents, the model aggregates these document memories from the memory bank to better answer user questions. The parameters of the LLM itself do not change during training and inference,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsBalanced Selection