CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models
Dongfang Li, Zetian Sun, Xinshuo Hu, Baotian Hu, Min Zhang

TL;DR
This paper introduces CMT, a memory compression method for LLMs that enables continual learning without retraining, by compressing new information into a memory bank to improve adaptability and knowledge retention.
Contribution
The paper proposes a novel online adaptation framework for LLMs that uses memory compression inspired by human memory, with new techniques for encoding, retrieval, and aggregation.
Findings
Improves model adaptability on continual learning datasets
Enhances robustness across multiple base LLMs
Achieves +4.07 EM and +4.19 F1 in StreamingQA with Llama-2-7b
Abstract
Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However, updates are necessary to keep them in sync with rapidly evolving human knowledge. To address these challenges, this paper proposes the Compression Memory Training (CMT) method, an efficient and effective online adaptation framework for LLMs that features robust knowledge retention capabilities. Inspired by human memory mechanisms, CMT compresses and extracts information from new documents to be stored in a memory bank. When answering to queries related to these new documents, the model aggregates these document memories from the memory bank to better answer user questions. The parameters of the LLM itself do not change during training and inference,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsBalanced Selection
